2024-11-01T16:44:50.7772398Z Current runner version: '2.320.0' 2024-11-01T16:44:50.7777739Z Runner name: 'pytorch-rocm-hw-31' 2024-11-01T16:44:50.7778525Z Runner group name: 'linux.rocm.gpu.group' 2024-11-01T16:44:50.7779448Z Machine name: 'pytorch-rocm-hw-31' 2024-11-01T16:44:50.7793358Z Testing runner upgrade compatibility 2024-11-01T16:44:50.8435426Z ##[group]GITHUB_TOKEN Permissions 2024-11-01T16:44:50.8438951Z Contents: read 2024-11-01T16:44:50.8439806Z Metadata: read 2024-11-01T16:44:50.8440687Z ##[endgroup] 2024-11-01T16:44:50.8447358Z Secret source: Actions 2024-11-01T16:44:50.8448797Z Prepare workflow directory 2024-11-01T16:44:51.2354815Z Prepare all required actions 2024-11-01T16:44:51.2525738Z Getting action download info 2024-11-01T16:44:51.5313756Z Download action repository 'pytorch/pytorch@main' (SHA:d79c5143d8e625947b93f8821bbe181419f47dee) 2024-11-01T16:44:58.7729036Z Download action repository 'aws-actions/configure-aws-credentials@v4' (SHA:e3dd6a429d7300a6a4c196c26e071d42e0343502) 2024-11-01T16:44:59.3634248Z Download action repository 'aws-actions/amazon-ecr-login@v2' (SHA:062b18b96a7aff071d4dc91bc00c4c1a7945b076) 2024-11-01T16:44:59.8202764Z Download action repository 'pytorch/test-infra@main' (SHA:49fb39b5efb49007791d74b09885044cd64544bf) 2024-11-01T16:45:00.7057218Z Download action repository 'actions/upload-artifact@v3' (SHA:ff15f0306b3f739f7b6fd43fb5d26cd321bd4de5) 2024-11-01T16:45:01.2563553Z Getting action download info 2024-11-01T16:45:01.4257837Z Download action repository 'malfet/checkout@silent-checkout' (SHA:e07af140b3ccefc05679e3755b9db68f4ee4589c) 2024-11-01T16:45:02.0044070Z Getting action download info 2024-11-01T16:45:02.1909373Z Download action repository 'nick-fields/retry@v3.0.0' (SHA:7152eba30c6575329ac0576536151aca5a72780e) 2024-11-01T16:45:02.8126885Z Uses: pytorch/pytorch/.github/workflows/_rocm-test.yml@refs/tags/ciflow/trunk/138766 (d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea) 2024-11-01T16:45:02.8129071Z ##[group] Inputs 2024-11-01T16:45:02.8129454Z build-environment: linux-focal-rocm6.2-py3.10 2024-11-01T16:45:02.8130566Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu"}, {"config": "distributed", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]} 2024-11-01T16:45:02.8131961Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:45:02.8132704Z sync-tag: 2024-11-01T16:45:02.8133424Z timeout-minutes: 300 2024-11-01T16:45:02.8134285Z tests-to-include: test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor distributed/test_c10d_common distributed/test_c10d_nccl 2024-11-01T16:45:02.8135155Z ##[endgroup] 2024-11-01T16:45:02.8135841Z Complete job name: linux-focal-rocm6.2-py3.10 / test (default, 1, 2, linux.rocm.gpu) 2024-11-01T16:45:02.9368254Z ##[group]Run pytorch/pytorch/.github/actions/checkout-pytorch@main 2024-11-01T16:45:02.9368747Z with: 2024-11-01T16:45:02.9368988Z no-sudo: true 2024-11-01T16:45:02.9369252Z submodules: recursive 2024-11-01T16:45:02.9369528Z fetch-depth: 0 2024-11-01T16:45:02.9369778Z env: 2024-11-01T16:45:02.9370020Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:45:02.9370289Z ##[endgroup] 2024-11-01T16:45:02.9556712Z ##[group]Run echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2024-11-01T16:45:02.9557772Z echo "IN_CONTAINER_RUNNER=$(if [ -f /.inarc ] || [ -f /.incontainer ]; then echo true ; else echo false; fi)" >> "$GITHUB_OUTPUT" 2024-11-01T16:45:02.9578013Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:45:02.9578618Z env: 2024-11-01T16:45:02.9578865Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:45:02.9579138Z ##[endgroup] 2024-11-01T16:45:03.0126940Z ##[group]Run retry () { 2024-11-01T16:45:03.0127687Z retry () { 2024-11-01T16:45:03.0128567Z  $* || (sleep 1 && $*) || (sleep 2 && $*) || (sleep 4 && $*) || (sleep 8 && $*) 2024-11-01T16:45:03.0130044Z } 2024-11-01T16:45:03.0130607Z echo "${GITHUB_WORKSPACE}" 2024-11-01T16:45:03.0131345Z if [ -z "${NO_SUDO}" ]; then 2024-11-01T16:45:03.0132210Z  retry sudo rm -rf "${GITHUB_WORKSPACE}" 2024-11-01T16:45:03.0133024Z else 2024-11-01T16:45:03.0133639Z  retry rm -rf "${GITHUB_WORKSPACE}" 2024-11-01T16:45:03.0134576Z fi 2024-11-01T16:45:03.0135130Z mkdir "${GITHUB_WORKSPACE}" 2024-11-01T16:45:03.0162624Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:45:03.0163433Z env: 2024-11-01T16:45:03.0163891Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:45:03.0164478Z NO_SUDO: true 2024-11-01T16:45:03.0164947Z ##[endgroup] 2024-11-01T16:45:03.0265672Z /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-11-01T16:45:07.0381658Z ##[group]Run malfet/checkout@silent-checkout 2024-11-01T16:45:07.0382726Z with: 2024-11-01T16:45:07.0383427Z ref: d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:45:07.0384373Z fetch-depth: 0 2024-11-01T16:45:07.0385045Z submodules: recursive 2024-11-01T16:45:07.0385797Z quiet-checkout: true 2024-11-01T16:45:07.0386521Z repository: pytorch/pytorch 2024-11-01T16:45:07.0387401Z token: *** 2024-11-01T16:45:07.0387747Z ssh-strict: true 2024-11-01T16:45:07.0388054Z persist-credentials: true 2024-11-01T16:45:07.0388409Z clean: true 2024-11-01T16:45:07.0388707Z sparse-checkout-cone-mode: true 2024-11-01T16:45:07.0389067Z lfs: false 2024-11-01T16:45:07.0389365Z set-safe-directory: true 2024-11-01T16:45:07.0389682Z env: 2024-11-01T16:45:07.0389947Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:45:07.0390259Z ##[endgroup] 2024-11-01T16:45:07.1346079Z Syncing repository: pytorch/pytorch 2024-11-01T16:45:07.1347721Z ##[group]Getting Git version info 2024-11-01T16:45:07.1348473Z Working directory is '/home/pytorchci/actions-runner/_work/pytorch/pytorch' 2024-11-01T16:45:07.1349335Z [command]/usr/bin/git version 2024-11-01T16:45:07.1349699Z git version 2.34.1 2024-11-01T16:45:07.1350920Z ##[endgroup] 2024-11-01T16:45:07.1362554Z Temporarily overriding HOME='/home/pytorchci/actions-runner/_work/_temp/21a544c3-e194-4437-9c03-8b7dd2594567' before making global git config changes 2024-11-01T16:45:07.1363737Z Adding repository directory to the temporary git global config as a safe directory 2024-11-01T16:45:07.1365176Z [command]/usr/bin/git config --global --add safe.directory /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-11-01T16:45:07.1369083Z Deleting the contents of '/home/pytorchci/actions-runner/_work/pytorch/pytorch' 2024-11-01T16:45:07.1373171Z ##[group]Initializing the repository 2024-11-01T16:45:07.1374973Z [command]/usr/bin/git init /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-11-01T16:45:07.1408395Z hint: Using 'master' as the name for the initial branch. This default branch name 2024-11-01T16:45:07.1409418Z hint: is subject to change. To configure the initial branch name to use in all 2024-11-01T16:45:07.1410250Z hint: of your new repositories, which will suppress this warning, call: 2024-11-01T16:45:07.1410776Z hint: 2024-11-01T16:45:07.1411244Z hint: git config --global init.defaultBranch 2024-11-01T16:45:07.1411708Z hint: 2024-11-01T16:45:07.1412195Z hint: Names commonly chosen instead of 'master' are 'main', 'trunk' and 2024-11-01T16:45:07.1412981Z hint: 'development'. The just-created branch can be renamed via this command: 2024-11-01T16:45:07.1413543Z hint: 2024-11-01T16:45:07.1413841Z hint: git branch -m 2024-11-01T16:45:07.1414555Z Initialized empty Git repository in /home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/ 2024-11-01T16:45:07.1419689Z [command]/usr/bin/git remote add origin https://github.com/pytorch/pytorch 2024-11-01T16:45:07.1448153Z ##[endgroup] 2024-11-01T16:45:07.1448749Z ##[group]Disabling automatic garbage collection 2024-11-01T16:45:07.1449780Z [command]/usr/bin/git config --local gc.auto 0 2024-11-01T16:45:07.1480780Z ##[endgroup] 2024-11-01T16:45:07.1481334Z ##[group]Setting up auth 2024-11-01T16:45:07.1486524Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2024-11-01T16:45:07.1512953Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2024-11-01T16:45:07.1764877Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2024-11-01T16:45:07.1794386Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2024-11-01T16:45:07.2043689Z [command]/usr/bin/git config --local http.https://github.com/.extraheader AUTHORIZATION: basic *** 2024-11-01T16:45:07.2076414Z ##[endgroup] 2024-11-01T16:45:07.2077470Z ##[group]Fetching the repository 2024-11-01T16:45:07.2082826Z [command]/usr/bin/git -c protocol.version=2 fetch --prune --progress --no-recurse-submodules --quiet origin +refs/heads/*:refs/remotes/origin/* +refs/tags/*:refs/tags/* 2024-11-01T16:45:10.5114957Z remote: Enumerating objects: 1053119 2024-11-01T16:45:10.5115960Z remote: Enumerating objects: 1053739, done. 2024-11-01T16:45:10.5116916Z remote: Counting objects: 0% (1/620) 2024-11-01T16:45:10.5117796Z remote: Counting objects: 1% (7/620) 2024-11-01T16:45:10.5118681Z remote: Counting objects: 2% (13/620) 2024-11-01T16:45:10.5119529Z remote: Counting objects: 3% (19/620) 2024-11-01T16:45:10.5120399Z remote: Counting objects: 4% (25/620) 2024-11-01T16:45:10.5121274Z remote: Counting objects: 5% (31/620) 2024-11-01T16:45:10.5122159Z remote: Counting objects: 6% (38/620) 2024-11-01T16:45:10.5123043Z remote: Counting objects: 7% (44/620) 2024-11-01T16:45:10.5123919Z remote: Counting objects: 8% (50/620) 2024-11-01T16:45:10.5124784Z remote: Counting objects: 9% (56/620) 2024-11-01T16:45:10.5125622Z remote: Counting objects: 10% (62/620) 2024-11-01T16:45:10.5126456Z remote: Counting objects: 11% (69/620) 2024-11-01T16:45:10.5127344Z remote: Counting objects: 12% (75/620) 2024-11-01T16:45:10.5128201Z remote: Counting objects: 13% (81/620) 2024-11-01T16:45:10.5129029Z remote: Counting objects: 14% (87/620) 2024-11-01T16:45:10.5129823Z remote: Counting objects: 15% (93/620) 2024-11-01T16:45:10.5130661Z remote: Counting objects: 16% (100/620) 2024-11-01T16:45:10.5131550Z remote: Counting objects: 17% (106/620) 2024-11-01T16:45:10.5132387Z remote: Counting objects: 18% (112/620) 2024-11-01T16:45:10.5133387Z remote: Counting objects: 19% (118/620) 2024-11-01T16:45:10.5135160Z remote: Counting objects: 20% (124/620) 2024-11-01T16:45:10.5136154Z remote: Counting objects: 21% (131/620) 2024-11-01T16:45:10.5137152Z remote: Counting objects: 22% (137/620) 2024-11-01T16:45:10.5138851Z remote: Counting objects: 23% (143/620) 2024-11-01T16:45:10.5139826Z remote: Counting objects: 24% (149/620) 2024-11-01T16:45:10.5140681Z remote: Counting objects: 25% (155/620) 2024-11-01T16:45:10.5141528Z remote: Counting objects: 26% (162/620) 2024-11-01T16:45:10.5142381Z remote: Counting objects: 27% (168/620) 2024-11-01T16:45:10.5143194Z remote: Counting objects: 28% (174/620) 2024-11-01T16:45:10.5144113Z remote: Counting objects: 29% (180/620) 2024-11-01T16:45:10.5145111Z remote: Counting objects: 30% (186/620) 2024-11-01T16:45:10.5146151Z remote: Counting objects: 31% (193/620) 2024-11-01T16:45:10.5147045Z remote: Counting objects: 32% (199/620) 2024-11-01T16:45:10.5147878Z remote: Counting objects: 33% (205/620) 2024-11-01T16:45:10.5148710Z remote: Counting objects: 34% (211/620) 2024-11-01T16:45:10.5149912Z remote: Counting objects: 35% (217/620) 2024-11-01T16:45:10.5150764Z remote: Counting objects: 36% (224/620) 2024-11-01T16:45:10.5151592Z remote: Counting objects: 37% (230/620) 2024-11-01T16:45:10.5152411Z remote: Counting objects: 38% (236/620) 2024-11-01T16:45:10.5153248Z remote: Counting objects: 39% (242/620) 2024-11-01T16:45:10.5154067Z remote: Counting objects: 40% (248/620) 2024-11-01T16:45:10.5154890Z remote: Counting objects: 41% (255/620) 2024-11-01T16:45:10.5155689Z remote: Counting objects: 42% (261/620) 2024-11-01T16:45:10.5156504Z remote: Counting objects: 43% (267/620) 2024-11-01T16:45:10.5157326Z remote: Counting objects: 44% (273/620) 2024-11-01T16:45:10.5158137Z remote: Counting objects: 45% (279/620) 2024-11-01T16:45:10.5158973Z remote: Counting objects: 46% (286/620) 2024-11-01T16:45:10.5159778Z remote: Counting objects: 47% (292/620) 2024-11-01T16:45:10.5160605Z remote: Counting objects: 48% (298/620) 2024-11-01T16:45:10.5161409Z remote: Counting objects: 49% (304/620) 2024-11-01T16:45:10.5162233Z remote: Counting objects: 50% (310/620) 2024-11-01T16:45:10.5163042Z remote: Counting objects: 51% (317/620) 2024-11-01T16:45:10.5163842Z remote: Counting objects: 52% (323/620) 2024-11-01T16:45:10.5164651Z remote: Counting objects: 53% (329/620) 2024-11-01T16:45:10.5165464Z remote: Counting objects: 54% (335/620) 2024-11-01T16:45:10.5166278Z remote: Counting objects: 55% (341/620) 2024-11-01T16:45:10.5167087Z remote: Counting objects: 56% (348/620) 2024-11-01T16:45:10.5167896Z remote: Counting objects: 57% (354/620) 2024-11-01T16:45:10.5168701Z remote: Counting objects: 58% (360/620) 2024-11-01T16:45:10.5169505Z remote: Counting objects: 59% (366/620) 2024-11-01T16:45:10.5170323Z remote: Counting objects: 60% (372/620) 2024-11-01T16:45:10.5171142Z remote: Counting objects: 61% (379/620) 2024-11-01T16:45:10.5171970Z remote: Counting objects: 62% (385/620) 2024-11-01T16:45:10.5172940Z remote: Counting objects: 63% (391/620) 2024-11-01T16:45:10.5173753Z remote: Counting objects: 64% (397/620) 2024-11-01T16:45:10.5174572Z remote: Counting objects: 65% (403/620) 2024-11-01T16:45:10.5175391Z remote: Counting objects: 66% (410/620) 2024-11-01T16:45:10.5176215Z remote: Counting objects: 67% (416/620) 2024-11-01T16:45:10.5177024Z remote: Counting objects: 68% (422/620) 2024-11-01T16:45:10.5177847Z remote: Counting objects: 69% (428/620) 2024-11-01T16:45:10.5178812Z remote: Counting objects: 70% (434/620) 2024-11-01T16:45:10.5179664Z remote: Counting objects: 71% (441/620) 2024-11-01T16:45:10.5180496Z remote: Counting objects: 72% (447/620) 2024-11-01T16:45:10.5181631Z remote: Counting objects: 73% (453/620) 2024-11-01T16:45:10.5182483Z remote: Counting objects: 74% (459/620) 2024-11-01T16:45:10.5183297Z remote: Counting objects: 75% (465/620) 2024-11-01T16:45:10.5184101Z remote: Counting objects: 76% (472/620) 2024-11-01T16:45:10.5184924Z remote: Counting objects: 77% (478/620) 2024-11-01T16:45:10.5185738Z remote: Counting objects: 78% (484/620) 2024-11-01T16:45:10.5186547Z remote: Counting objects: 79% (490/620) 2024-11-01T16:45:10.5187361Z remote: Counting objects: 80% (496/620) 2024-11-01T16:45:10.5188180Z remote: Counting objects: 81% (503/620) 2024-11-01T16:45:10.5188989Z remote: Counting objects: 82% (509/620) 2024-11-01T16:45:10.5189803Z remote: Counting objects: 83% (515/620) 2024-11-01T16:45:10.5190607Z remote: Counting objects: 84% (521/620) 2024-11-01T16:45:10.5191425Z remote: Counting objects: 85% (527/620) 2024-11-01T16:45:10.5192263Z remote: Counting objects: 86% (534/620) 2024-11-01T16:45:10.5193365Z remote: Counting objects: 87% (540/620) 2024-11-01T16:45:10.5194174Z remote: Counting objects: 88% (546/620) 2024-11-01T16:45:10.5194973Z remote: Counting objects: 89% (552/620) 2024-11-01T16:45:10.5195792Z remote: Counting objects: 90% (558/620) 2024-11-01T16:45:10.5196606Z remote: Counting objects: 91% (565/620) 2024-11-01T16:45:10.5197406Z remote: Counting objects: 92% (571/620) 2024-11-01T16:45:10.5198224Z remote: Counting objects: 93% (577/620) 2024-11-01T16:45:10.5199043Z remote: Counting objects: 94% (583/620) 2024-11-01T16:45:10.5650694Z remote: Counting objects: 95% (589/620) 2024-11-01T16:45:10.5651849Z remote: Counting objects: 96% (596/620) 2024-11-01T16:45:10.5652951Z remote: Counting objects: 97% (602/620) 2024-11-01T16:45:10.5653963Z remote: Counting objects: 98% (608/620) 2024-11-01T16:45:10.5654980Z remote: Counting objects: 99% (614/620) 2024-11-01T16:45:10.5655997Z remote: Counting objects: 100% (620/620) 2024-11-01T16:45:10.5657101Z remote: Counting objects: 100% (620/620), done. 2024-11-01T16:45:10.5658331Z remote: Compressing objects: 0% (1/316) 2024-11-01T16:45:10.5659263Z remote: Compressing objects: 1% (4/316) 2024-11-01T16:45:10.5660147Z remote: Compressing objects: 2% (7/316) 2024-11-01T16:45:10.5661018Z remote: Compressing objects: 3% (10/316) 2024-11-01T16:45:10.5661931Z remote: Compressing objects: 4% (13/316) 2024-11-01T16:45:10.6193660Z remote: Compressing objects: 5% (16/316) 2024-11-01T16:45:10.6194697Z remote: Compressing objects: 6% (19/316) 2024-11-01T16:45:10.6195638Z remote: Compressing objects: 7% (23/316) 2024-11-01T16:45:10.6196546Z remote: Compressing objects: 8% (26/316) 2024-11-01T16:45:10.6265690Z remote: Compressing objects: 9% (29/316) 2024-11-01T16:45:10.6266768Z remote: Compressing objects: 10% (32/316) 2024-11-01T16:45:10.6267752Z remote: Compressing objects: 11% (35/316) 2024-11-01T16:45:10.6268708Z remote: Compressing objects: 12% (38/316) 2024-11-01T16:45:10.6269617Z remote: Compressing objects: 13% (42/316) 2024-11-01T16:45:10.6270521Z remote: Compressing objects: 14% (45/316) 2024-11-01T16:45:10.6271410Z remote: Compressing objects: 15% (48/316) 2024-11-01T16:45:10.6272285Z remote: Compressing objects: 16% (51/316) 2024-11-01T16:45:10.6273179Z remote: Compressing objects: 17% (54/316) 2024-11-01T16:45:10.6274078Z remote: Compressing objects: 18% (57/316) 2024-11-01T16:45:10.6274986Z remote: Compressing objects: 19% (61/316) 2024-11-01T16:45:10.6275885Z remote: Compressing objects: 20% (64/316) 2024-11-01T16:45:10.6276760Z remote: Compressing objects: 21% (67/316) 2024-11-01T16:45:10.6277645Z remote: Compressing objects: 22% (70/316) 2024-11-01T16:45:10.6278984Z remote: Compressing objects: 23% (73/316) 2024-11-01T16:45:10.6279902Z remote: Compressing objects: 24% (76/316) 2024-11-01T16:45:10.6280782Z remote: Compressing objects: 25% (79/316) 2024-11-01T16:45:10.6281648Z remote: Compressing objects: 26% (83/316) 2024-11-01T16:45:10.6282504Z remote: Compressing objects: 27% (86/316) 2024-11-01T16:45:10.6283394Z remote: Compressing objects: 28% (89/316) 2024-11-01T16:45:10.6284260Z remote: Compressing objects: 29% (92/316) 2024-11-01T16:45:10.6285107Z remote: Compressing objects: 30% (95/316) 2024-11-01T16:45:10.6285968Z remote: Compressing objects: 31% (98/316) 2024-11-01T16:45:10.6327935Z remote: Compressing objects: 32% (102/316) 2024-11-01T16:45:10.6329009Z remote: Compressing objects: 33% (105/316) 2024-11-01T16:45:10.6329959Z remote: Compressing objects: 34% (108/316) 2024-11-01T16:45:10.6330925Z remote: Compressing objects: 35% (111/316) 2024-11-01T16:45:10.6331880Z remote: Compressing objects: 36% (114/316) 2024-11-01T16:45:10.6333329Z remote: Compressing objects: 37% (117/316) 2024-11-01T16:45:10.6334246Z remote: Compressing objects: 38% (121/316) 2024-11-01T16:45:10.6335140Z remote: Compressing objects: 39% (124/316) 2024-11-01T16:45:10.6336011Z remote: Compressing objects: 40% (127/316) 2024-11-01T16:45:10.6336886Z remote: Compressing objects: 41% (130/316) 2024-11-01T16:45:10.6337757Z remote: Compressing objects: 42% (133/316) 2024-11-01T16:45:10.6338812Z remote: Compressing objects: 43% (136/316) 2024-11-01T16:45:10.6339711Z remote: Compressing objects: 44% (140/316) 2024-11-01T16:45:10.6340581Z remote: Compressing objects: 45% (143/316) 2024-11-01T16:45:10.6341442Z remote: Compressing objects: 46% (146/316) 2024-11-01T16:45:10.6342322Z remote: Compressing objects: 47% (149/316) 2024-11-01T16:45:10.6343198Z remote: Compressing objects: 48% (152/316) 2024-11-01T16:45:10.6344063Z remote: Compressing objects: 49% (155/316) 2024-11-01T16:45:10.6344930Z remote: Compressing objects: 50% (158/316) 2024-11-01T16:45:10.6345800Z remote: Compressing objects: 51% (162/316) 2024-11-01T16:45:10.6360600Z remote: Compressing objects: 52% (165/316) 2024-11-01T16:45:10.6361669Z remote: Compressing objects: 53% (168/316) 2024-11-01T16:45:10.6362624Z remote: Compressing objects: 54% (171/316) 2024-11-01T16:45:10.6363561Z remote: Compressing objects: 55% (174/316) 2024-11-01T16:45:10.6364474Z remote: Compressing objects: 56% (177/316) 2024-11-01T16:45:10.6365383Z remote: Compressing objects: 57% (181/316) 2024-11-01T16:45:10.6366272Z remote: Compressing objects: 58% (184/316) 2024-11-01T16:45:10.6367148Z remote: Compressing objects: 59% (187/316) 2024-11-01T16:45:10.6368052Z remote: Compressing objects: 60% (190/316) 2024-11-01T16:45:10.6369027Z remote: Compressing objects: 61% (193/316) 2024-11-01T16:45:10.6369953Z remote: Compressing objects: 62% (196/316) 2024-11-01T16:45:10.6370848Z remote: Compressing objects: 63% (200/316) 2024-11-01T16:45:10.6371752Z remote: Compressing objects: 64% (203/316) 2024-11-01T16:45:10.6372641Z remote: Compressing objects: 65% (206/316) 2024-11-01T16:45:10.6373511Z remote: Compressing objects: 66% (209/316) 2024-11-01T16:45:10.6374401Z remote: Compressing objects: 67% (212/316) 2024-11-01T16:45:10.6375281Z remote: Compressing objects: 68% (215/316) 2024-11-01T16:45:10.6376147Z remote: Compressing objects: 69% (219/316) 2024-11-01T16:45:10.6377032Z remote: Compressing objects: 70% (222/316) 2024-11-01T16:45:10.6377907Z remote: Compressing objects: 71% (225/316) 2024-11-01T16:45:10.6378961Z remote: Compressing objects: 72% (228/316) 2024-11-01T16:45:10.6380301Z remote: Compressing objects: 73% (231/316) 2024-11-01T16:45:10.6381228Z remote: Compressing objects: 74% (234/316) 2024-11-01T16:45:10.6382127Z remote: Compressing objects: 75% (237/316) 2024-11-01T16:45:10.6382998Z remote: Compressing objects: 76% (241/316) 2024-11-01T16:45:10.6383878Z remote: Compressing objects: 77% (244/316) 2024-11-01T16:45:10.6389872Z remote: Compressing objects: 78% (247/316) 2024-11-01T16:45:10.6390942Z remote: Compressing objects: 79% (250/316) 2024-11-01T16:45:10.6391907Z remote: Compressing objects: 80% (253/316) 2024-11-01T16:45:10.6392891Z remote: Compressing objects: 81% (256/316) 2024-11-01T16:45:10.6393809Z remote: Compressing objects: 82% (260/316) 2024-11-01T16:45:10.6394750Z remote: Compressing objects: 83% (263/316) 2024-11-01T16:45:10.6395640Z remote: Compressing objects: 84% (266/316) 2024-11-01T16:45:10.6396543Z remote: Compressing objects: 85% (269/316) 2024-11-01T16:45:10.6397495Z remote: Compressing objects: 86% (272/316) 2024-11-01T16:45:10.6398852Z remote: Compressing objects: 87% (275/316) 2024-11-01T16:45:10.6399737Z remote: Compressing objects: 88% (279/316) 2024-11-01T16:45:10.6400618Z remote: Compressing objects: 89% (282/316) 2024-11-01T16:45:10.6401507Z remote: Compressing objects: 90% (285/316) 2024-11-01T16:45:10.6402396Z remote: Compressing objects: 91% (288/316) 2024-11-01T16:45:10.6403253Z remote: Compressing objects: 92% (291/316) 2024-11-01T16:45:10.6404132Z remote: Compressing objects: 93% (294/316) 2024-11-01T16:45:10.6405027Z remote: Compressing objects: 94% (298/316) 2024-11-01T16:45:10.6405893Z remote: Compressing objects: 95% (301/316) 2024-11-01T16:45:10.6406772Z remote: Compressing objects: 96% (304/316) 2024-11-01T16:45:10.6407662Z remote: Compressing objects: 97% (307/316) 2024-11-01T16:45:10.6408528Z remote: Compressing objects: 98% (310/316) 2024-11-01T16:45:10.6409417Z remote: Compressing objects: 99% (313/316) 2024-11-01T16:45:10.6410302Z remote: Compressing objects: 100% (316/316) 2024-11-01T16:45:10.6411258Z remote: Compressing objects: 100% (316/316), done. 2024-11-01T16:45:39.2372557Z remote: Total 1053739 (delta 404), reused 469 (delta 303), pack-reused 1053119 (from 1) 2024-11-01T16:45:50.9853909Z [command]/usr/bin/git rev-parse --verify --quiet d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea^{object} 2024-11-01T16:45:50.9899585Z d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:45:50.9917791Z ##[endgroup] 2024-11-01T16:45:50.9918889Z ##[group]Determining the checkout info 2024-11-01T16:45:50.9919987Z ##[endgroup] 2024-11-01T16:45:50.9920881Z ##[group]Checking out the ref 2024-11-01T16:45:50.9922405Z [command]/usr/bin/git checkout --quiet --force d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:45:52.3928060Z ##[endgroup] 2024-11-01T16:45:52.3929305Z ##[group]Setting up auth for fetching submodules 2024-11-01T16:45:52.3931362Z [command]/usr/bin/git config --global http.https://github.com/.extraheader AUTHORIZATION: basic *** 2024-11-01T16:45:52.3995638Z [command]/usr/bin/git config --global --unset-all url.https://github.com/.insteadOf 2024-11-01T16:45:52.4048837Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf git@github.com: 2024-11-01T16:45:52.4093378Z [command]/usr/bin/git config --global --add url.https://github.com/.insteadOf org-21003710@github.com: 2024-11-01T16:45:52.4143198Z ##[endgroup] 2024-11-01T16:45:52.4144267Z ##[group]Fetching submodules 2024-11-01T16:45:52.4149097Z [command]/usr/bin/git submodule sync --recursive 2024-11-01T16:45:52.4477504Z [command]/usr/bin/git -c protocol.version=2 submodule update --init --force --recursive 2024-11-01T16:45:52.4755213Z Submodule 'android/libs/fbjni' (https://github.com/facebookincubator/fbjni.git) registered for path 'android/libs/fbjni' 2024-11-01T16:45:52.4758633Z Submodule 'third_party/NNPACK_deps/FP16' (https://github.com/Maratyszcza/FP16.git) registered for path 'third_party/FP16' 2024-11-01T16:45:52.4761406Z Submodule 'third_party/NNPACK_deps/FXdiv' (https://github.com/Maratyszcza/FXdiv.git) registered for path 'third_party/FXdiv' 2024-11-01T16:45:52.4763932Z Submodule 'third_party/NNPACK' (https://github.com/Maratyszcza/NNPACK.git) registered for path 'third_party/NNPACK' 2024-11-01T16:45:52.4767959Z Submodule 'third_party/NVTX' (https://github.com/NVIDIA/NVTX.git) registered for path 'third_party/NVTX' 2024-11-01T16:45:52.4772700Z Submodule 'third_party/VulkanMemoryAllocator' (https://github.com/GPUOpen-LibrariesAndSDKs/VulkanMemoryAllocator.git) registered for path 'third_party/VulkanMemoryAllocator' 2024-11-01T16:45:52.4776243Z Submodule 'third_party/XNNPACK' (https://github.com/google/XNNPACK.git) registered for path 'third_party/XNNPACK' 2024-11-01T16:45:52.4779314Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/benchmark' 2024-11-01T16:45:52.4785091Z Submodule 'third_party/composable_kernel' (https://github.com/ROCm/composable_kernel.git) registered for path 'third_party/composable_kernel' 2024-11-01T16:45:52.4790664Z Submodule 'third_party/cpp-httplib' (https://github.com/yhirose/cpp-httplib.git) registered for path 'third_party/cpp-httplib' 2024-11-01T16:45:52.4793640Z Submodule 'third_party/cpuinfo' (https://github.com/pytorch/cpuinfo.git) registered for path 'third_party/cpuinfo' 2024-11-01T16:45:52.4796162Z Submodule 'third_party/cudnn_frontend' (https://github.com/NVIDIA/cudnn-frontend.git) registered for path 'third_party/cudnn_frontend' 2024-11-01T16:45:52.4799772Z Submodule 'third_party/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/cutlass' 2024-11-01T16:45:52.4804200Z Submodule 'third_party/eigen' (https://gitlab.com/libeigen/eigen.git) registered for path 'third_party/eigen' 2024-11-01T16:45:52.4808255Z Submodule 'third_party/fbgemm' (https://github.com/pytorch/fbgemm) registered for path 'third_party/fbgemm' 2024-11-01T16:45:52.4813436Z Submodule 'third_party/flatbuffers' (https://github.com/google/flatbuffers.git) registered for path 'third_party/flatbuffers' 2024-11-01T16:45:52.4816422Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/fmt' 2024-11-01T16:45:52.4821551Z Submodule 'third_party/gemmlowp/gemmlowp' (https://github.com/google/gemmlowp.git) registered for path 'third_party/gemmlowp/gemmlowp' 2024-11-01T16:45:52.4825945Z Submodule 'third_party/gloo' (https://github.com/facebookincubator/gloo) registered for path 'third_party/gloo' 2024-11-01T16:45:52.4830421Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/googletest' 2024-11-01T16:45:52.4834675Z Submodule 'third_party/ideep' (https://github.com/intel/ideep) registered for path 'third_party/ideep' 2024-11-01T16:45:52.4839330Z Submodule 'third_party/ittapi' (https://github.com/intel/ittapi.git) registered for path 'third_party/ittapi' 2024-11-01T16:45:52.4844199Z Submodule 'third_party/kineto' (https://github.com/pytorch/kineto) registered for path 'third_party/kineto' 2024-11-01T16:45:52.4849011Z Submodule 'third_party/mimalloc' (https://github.com/microsoft/mimalloc.git) registered for path 'third_party/mimalloc' 2024-11-01T16:45:52.4853529Z Submodule 'third_party/nccl/nccl' (https://github.com/NVIDIA/nccl) registered for path 'third_party/nccl/nccl' 2024-11-01T16:45:52.4858417Z Submodule 'third_party/nlohmann' (https://github.com/nlohmann/json.git) registered for path 'third_party/nlohmann' 2024-11-01T16:45:52.4863266Z Submodule 'third_party/onnx' (https://github.com/onnx/onnx.git) registered for path 'third_party/onnx' 2024-11-01T16:45:52.4868863Z Submodule 'third_party/opentelemetry-cpp' (https://github.com/open-telemetry/opentelemetry-cpp.git) registered for path 'third_party/opentelemetry-cpp' 2024-11-01T16:45:52.4873419Z Submodule 'third_party/pocketfft' (https://github.com/mreineck/pocketfft) registered for path 'third_party/pocketfft' 2024-11-01T16:45:52.4879173Z Submodule 'third_party/protobuf' (https://github.com/protocolbuffers/protobuf.git) registered for path 'third_party/protobuf' 2024-11-01T16:45:52.4883947Z Submodule 'third_party/NNPACK_deps/psimd' (https://github.com/Maratyszcza/psimd.git) registered for path 'third_party/psimd' 2024-11-01T16:45:52.4889146Z Submodule 'third_party/NNPACK_deps/pthreadpool' (https://github.com/Maratyszcza/pthreadpool.git) registered for path 'third_party/pthreadpool' 2024-11-01T16:45:52.4894123Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/pybind11' 2024-11-01T16:45:52.4899570Z Submodule 'third_party/python-peachpy' (https://github.com/malfet/PeachPy.git) registered for path 'third_party/python-peachpy' 2024-11-01T16:45:52.4904897Z Submodule 'third_party/sleef' (https://github.com/shibatch/sleef) registered for path 'third_party/sleef' 2024-11-01T16:45:52.4910717Z Submodule 'third_party/tensorpipe' (https://github.com/pytorch/tensorpipe.git) registered for path 'third_party/tensorpipe' 2024-11-01T16:45:52.4977442Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/android/libs/fbjni'... 2024-11-01T16:45:53.2042513Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/FP16'... 2024-11-01T16:45:53.7728817Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/FXdiv'... 2024-11-01T16:45:54.3386916Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/NNPACK'... 2024-11-01T16:45:56.1896445Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/NVTX'... 2024-11-01T16:45:57.0250312Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/VulkanMemoryAllocator'... 2024-11-01T16:46:00.2883654Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/XNNPACK'... 2024-11-01T16:46:08.5882202Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/benchmark'... 2024-11-01T16:46:09.5217254Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/composable_kernel'... 2024-11-01T16:46:11.4920000Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/cpp-httplib'... 2024-11-01T16:46:12.3223535Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/cpuinfo'... 2024-11-01T16:46:13.3345245Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/cudnn_frontend'... 2024-11-01T16:46:15.1786134Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/cutlass'... 2024-11-01T16:46:17.6290632Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/eigen'... 2024-11-01T16:46:22.6701152Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm'... 2024-11-01T16:46:24.6106654Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/flatbuffers'... 2024-11-01T16:46:26.7637090Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fmt'... 2024-11-01T16:46:28.1783038Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/gemmlowp/gemmlowp'... 2024-11-01T16:46:28.9830817Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/gloo'... 2024-11-01T16:46:29.8390519Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/googletest'... 2024-11-01T16:46:31.2284852Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/ideep'... 2024-11-01T16:46:31.9901533Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/ittapi'... 2024-11-01T16:46:32.6755544Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto'... 2024-11-01T16:46:35.7159599Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/mimalloc'... 2024-11-01T16:46:36.8035296Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/nccl/nccl'... 2024-11-01T16:46:37.5318859Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/nlohmann'... 2024-11-01T16:46:45.3242903Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx'... 2024-11-01T16:46:47.6398427Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp'... 2024-11-01T16:46:53.8873599Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/pocketfft'... 2024-11-01T16:46:54.5043500Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/protobuf'... 2024-11-01T16:47:03.4767825Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/psimd'... 2024-11-01T16:47:03.9831416Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/pthreadpool'... 2024-11-01T16:47:04.5854448Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/pybind11'... 2024-11-01T16:47:05.9847471Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/python-peachpy'... 2024-11-01T16:47:06.7639016Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/sleef'... 2024-11-01T16:47:07.9483515Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe'... 2024-11-01T16:47:08.7630652Z Submodule path 'android/libs/fbjni': checked out '7e1e1fe3858c63c251c637ae41a20de425dde96f' 2024-11-01T16:47:08.7908891Z Submodule path 'third_party/FP16': checked out '4dfe081cf6bcd15db339cf2680b9281b8451eeb3' 2024-11-01T16:47:08.8143110Z Submodule path 'third_party/FXdiv': checked out 'b408327ac2a15ec3e43352421954f5b1967701d1' 2024-11-01T16:47:08.8554573Z Submodule path 'third_party/NNPACK': checked out 'c07e3a0400713d546e0dea2d5466dd22ea389c73' 2024-11-01T16:47:08.9028685Z Submodule path 'third_party/NVTX': checked out 'e170594ac7cf1dac584da473d4ca9301087090c1' 2024-11-01T16:47:08.9535995Z Submodule path 'third_party/VulkanMemoryAllocator': checked out 'a6bfc237255a6bac1513f7c1ebde6d8aed6b5191' 2024-11-01T16:47:09.7738568Z Submodule path 'third_party/XNNPACK': checked out '87ee0b46b834f67bad9025d4a82ed5654f3403d3' 2024-11-01T16:47:09.8204466Z Submodule path 'third_party/benchmark': checked out '0d98dba29d66e93259db7daa53a9327df767a415' 2024-11-01T16:47:10.0408285Z Submodule path 'third_party/composable_kernel': checked out 'cedccd59c94cb0c74e7ec0d0f6c791aed081febc' 2024-11-01T16:47:10.1131740Z Submodule path 'third_party/cpp-httplib': checked out '3b6597bba913d51161383657829b7e644e59c006' 2024-11-01T16:47:10.2251341Z Submodule path 'third_party/cpuinfo': checked out '1e83a2fdd3102f65c6f1fb602c1b320486218a99' 2024-11-01T16:47:10.2753215Z Submodule path 'third_party/cudnn_frontend': checked out '936021bfed8c91dc416af1588b2c4eca631a9e45' 2024-11-01T16:47:10.7888003Z Submodule path 'third_party/cutlass': checked out 'bbe579a9e3beb6ea6626d9227ec32d0dae119a49' 2024-11-01T16:47:11.0372885Z Submodule path 'third_party/eigen': checked out '3147391d946bb4b6c68edd901f2add6ac1f31f8c' 2024-11-01T16:47:11.1359530Z Submodule path 'third_party/fbgemm': checked out 'dbc3157bf256f1339b3fa1fef2be89ac4078be0e' 2024-11-01T16:47:11.1435473Z Submodule 'third_party/asmjit' (https://github.com/asmjit/asmjit.git) registered for path 'third_party/fbgemm/third_party/asmjit' 2024-11-01T16:47:11.1441979Z Submodule 'third_party/cpuinfo' (https://github.com/pytorch/cpuinfo) registered for path 'third_party/fbgemm/third_party/cpuinfo' 2024-11-01T16:47:11.1444684Z Submodule 'third_party/cutlass' (https://github.com/NVIDIA/cutlass.git) registered for path 'third_party/fbgemm/third_party/cutlass' 2024-11-01T16:47:11.1447481Z Submodule 'third_party/googletest' (https://github.com/google/googletest) registered for path 'third_party/fbgemm/third_party/googletest' 2024-11-01T16:47:11.1450997Z Submodule 'third_party/hipify_torch' (https://github.com/ROCmSoftwarePlatform/hipify_torch.git) registered for path 'third_party/fbgemm/third_party/hipify_torch' 2024-11-01T16:47:11.1513445Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/asmjit'... 2024-11-01T16:47:12.5248829Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/cpuinfo'... 2024-11-01T16:47:14.4868806Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/cutlass'... 2024-11-01T16:47:16.8858611Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/googletest'... 2024-11-01T16:47:18.1176323Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/fbgemm/third_party/hipify_torch'... 2024-11-01T16:47:18.8710390Z Submodule path 'third_party/fbgemm/third_party/asmjit': checked out 'd3fbf7c9bc7c1d1365a94a45614b91c5a3706b81' 2024-11-01T16:47:18.9837550Z Submodule path 'third_party/fbgemm/third_party/cpuinfo': checked out 'ed8b86a253800bafdb7b25c5c399f91bff9cb1f3' 2024-11-01T16:47:19.4026421Z Submodule path 'third_party/fbgemm/third_party/cutlass': checked out 'fc9ebc645b63f3a6bc80aaefde5c063fb72110d6' 2024-11-01T16:47:19.4775047Z Submodule path 'third_party/fbgemm/third_party/googletest': checked out 'cbf019de22c8dd37b2108da35b2748fd702d1796' 2024-11-01T16:47:19.5051957Z Submodule path 'third_party/fbgemm/third_party/hipify_torch': checked out '23f53b025b466d8ec3c45d52290d3442f7fbe6b1' 2024-11-01T16:47:19.6369094Z Submodule path 'third_party/flatbuffers': checked out '01834de25e4bf3975a9a00e816292b1ad0fe184b' 2024-11-01T16:47:19.6918144Z Submodule path 'third_party/fmt': checked out '0c9fce2ffefecfdce794e1859584e25877b7b592' 2024-11-01T16:47:19.7450598Z Submodule path 'third_party/gemmlowp/gemmlowp': checked out '3fb5c176c17c765a3492cd2f0321b0dab712f350' 2024-11-01T16:47:19.7846008Z Submodule path 'third_party/gloo': checked out '5354032ea08eadd7fc4456477f7f7c6308818509' 2024-11-01T16:47:19.8443748Z Submodule path 'third_party/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2024-11-01T16:47:19.8736980Z Submodule path 'third_party/ideep': checked out '41d636c2bbcea6bff0faf97cdb65a48cdde987af' 2024-11-01T16:47:19.8796401Z Submodule 'mkl-dnn' (https://github.com/intel/mkl-dnn.git) registered for path 'third_party/ideep/mkl-dnn' 2024-11-01T16:47:19.8844779Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/ideep/mkl-dnn'... 2024-11-01T16:47:31.0583083Z Submodule path 'third_party/ideep/mkl-dnn': checked out '66f0cb9eb66affd2da3bf5f8d897376f04aae6af' 2024-11-01T16:47:31.0990266Z Submodule path 'third_party/ittapi': checked out '5b8a7d7422611c3a0d799fb5fc5dd4abfae35b42' 2024-11-01T16:47:31.2013275Z Submodule path 'third_party/kineto': checked out 'ed052ea024b9468908d558b15cd3f7584fb0f492' 2024-11-01T16:47:31.2094999Z Submodule 'libkineto/third_party/dynolog' (https://github.com/facebookincubator/dynolog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog' 2024-11-01T16:47:31.2103454Z Submodule 'libkineto/third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/fmt' 2024-11-01T16:47:31.2107684Z Submodule 'libkineto/third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/googletest' 2024-11-01T16:47:31.2158384Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog'... 2024-11-01T16:47:32.1190698Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/fmt'... 2024-11-01T16:47:33.8067510Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/googletest'... 2024-11-01T16:47:35.1591566Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog': checked out '7d04a0053a845370ae06ce317a22a48e9edcc74e' 2024-11-01T16:47:35.1671813Z Submodule 'third_party/DCGM' (https://github.com/NVIDIA/DCGM.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-11-01T16:47:35.1675920Z Submodule 'third_party/cpr' (https://github.com/libcpr/cpr.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-11-01T16:47:35.1679071Z Submodule 'third_party/fmt' (https://github.com/fmtlib/fmt.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-11-01T16:47:35.1682269Z Submodule 'third_party/gflags' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-11-01T16:47:35.1686012Z Submodule 'third_party/glog' (https://github.com/google/glog.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-11-01T16:47:35.1690119Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-11-01T16:47:35.1693748Z Submodule 'third_party/json' (https://github.com/nlohmann/json.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-11-01T16:47:35.1697533Z Submodule 'third_party/pfs' (https://github.com/dtrugman/pfs.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-11-01T16:47:35.1753641Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM'... 2024-11-01T16:47:36.5080292Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/cpr'... 2024-11-01T16:47:37.3076135Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/fmt'... 2024-11-01T16:47:38.5467049Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags'... 2024-11-01T16:47:39.6272296Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/glog'... 2024-11-01T16:47:40.6330714Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/googletest'... 2024-11-01T16:47:41.8634847Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/json'... 2024-11-01T16:47:47.9266201Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/pfs'... 2024-11-01T16:47:48.7365551Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM': checked out 'ffde4e54bc7249a6039a5e6b45b395141e1217f9' 2024-11-01T16:47:48.7768319Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr': checked out '871ed52d350214a034f6ef8a3b8f51c5ce1bd400' 2024-11-01T16:47:48.8299531Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt': checked out 'cd4af11efc9c622896a3e4cb599fa28668ca3d05' 2024-11-01T16:47:48.8579910Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags': checked out 'e171aa2d15ed9eb17054558e0b3a6a413bb01067' 2024-11-01T16:47:48.8631419Z Submodule 'doc' (https://github.com/gflags/gflags.git) registered for path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-11-01T16:47:48.8689775Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc'... 2024-11-01T16:47:49.6327936Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc': checked out '8411df715cf522606e3b1aca386ddfc0b63d34b4' 2024-11-01T16:47:49.6670720Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog': checked out 'b33e3bad4c46c8a6345525fd822af355e5ef9446' 2024-11-01T16:47:49.7229858Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest': checked out '58d77fa8070e8cec2dc1ed015d66b454c8d78850' 2024-11-01T16:47:49.8375039Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/json': checked out '4f8fba14066156b73f1189a2b8bd568bde5284c5' 2024-11-01T16:47:49.8736583Z Submodule path 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs': checked out 'f68a2fa8ea36c783bdd760371411fcb495aa3150' 2024-11-01T16:47:49.9315719Z Submodule path 'third_party/kineto/libkineto/third_party/fmt': checked out '0041a40c1350ba702d475b9c4ad62da77caea164' 2024-11-01T16:47:49.9984462Z Submodule path 'third_party/kineto/libkineto/third_party/googletest': checked out '7aca84427f224eeed3144123d5230d5871e93347' 2024-11-01T16:47:50.0550479Z Submodule path 'third_party/mimalloc': checked out 'b66e3214d8a104669c2ec05ae91ebc26a8f5ab78' 2024-11-01T16:47:50.1024221Z Submodule path 'third_party/nccl/nccl': checked out 'ab2b89c4c339bd7f816fbc114a4b05d386b66290' 2024-11-01T16:47:50.2270421Z Submodule path 'third_party/nlohmann': checked out '87cda1d6646592ac5866dc703c8e1839046a6806' 2024-11-01T16:47:50.6431907Z Submodule path 'third_party/onnx': checked out 'b8baa8446686496da4cc8fda09f2b6fe65c2a02c' 2024-11-01T16:47:50.6520620Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/onnx/third_party/pybind11' 2024-11-01T16:47:50.6579794Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/onnx/third_party/pybind11'... 2024-11-01T16:47:52.1347082Z Submodule path 'third_party/onnx/third_party/pybind11': checked out '3e9dfa2866941655c56877882565e7577de6fc7b' 2024-11-01T16:47:52.2205513Z Submodule path 'third_party/opentelemetry-cpp': checked out 'a799f4aed9c94b765dcdaabaeab7d5e7e2310878' 2024-11-01T16:47:52.2277022Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark) registered for path 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-11-01T16:47:52.2280486Z Submodule 'third_party/googletest' (https://github.com/google/googletest) registered for path 'third_party/opentelemetry-cpp/third_party/googletest' 2024-11-01T16:47:52.2283902Z Submodule 'third_party/ms-gsl' (https://github.com/microsoft/GSL) registered for path 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-11-01T16:47:52.2287596Z Submodule 'third_party/nlohmann-json' (https://github.com/nlohmann/json) registered for path 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-11-01T16:47:52.2291121Z Submodule 'third_party/opentelemetry-proto' (https://github.com/open-telemetry/opentelemetry-proto) registered for path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-11-01T16:47:52.2294959Z Submodule 'third_party/opentracing-cpp' (https://github.com/opentracing/opentracing-cpp.git) registered for path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-11-01T16:47:52.2298810Z Submodule 'third_party/prometheus-cpp' (https://github.com/jupp0r/prometheus-cpp) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-11-01T16:47:52.2302534Z Submodule 'tools/vcpkg' (https://github.com/Microsoft/vcpkg) registered for path 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-11-01T16:47:52.2350523Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/benchmark'... 2024-11-01T16:47:53.1040659Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/googletest'... 2024-11-01T16:47:54.3731993Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/ms-gsl'... 2024-11-01T16:47:55.1143888Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/nlohmann-json'... 2024-11-01T16:48:01.7225477Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/opentelemetry-proto'... 2024-11-01T16:48:02.4519604Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/opentracing-cpp'... 2024-11-01T16:48:03.0639306Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp'... 2024-11-01T16:48:03.8323649Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/tools/vcpkg'... 2024-11-01T16:48:12.2398361Z Submodule path 'third_party/opentelemetry-cpp/third_party/benchmark': checked out 'd572f4777349d43653b21d6c2fc63020ab326db2' 2024-11-01T16:48:12.2948107Z Submodule path 'third_party/opentelemetry-cpp/third_party/googletest': checked out 'b796f7d44681514f58a683a3a71ff17c94edb0c1' 2024-11-01T16:48:12.3258945Z Submodule path 'third_party/opentelemetry-cpp/third_party/ms-gsl': checked out '6f4529395c5b7c2d661812257cd6780c67e54afa' 2024-11-01T16:48:12.4441159Z Submodule path 'third_party/opentelemetry-cpp/third_party/nlohmann-json': checked out 'bc889afb4c5bf1c0d8ee29ef35eaaf4c8bef8a5d' 2024-11-01T16:48:12.4737273Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto': checked out '4ca4f0335c63cda7ab31ea7ed70d6553aee14dce' 2024-11-01T16:48:12.5041155Z Submodule path 'third_party/opentelemetry-cpp/third_party/opentracing-cpp': checked out '06b57f48ded1fa3bdd3d4346f6ef29e40e08eaf5' 2024-11-01T16:48:12.5352225Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp': checked out 'c9ffcdda9086ffd9e1283ea7a0276d831f3c8a8d' 2024-11-01T16:48:12.5402813Z Submodule 'civetweb' (https://github.com/civetweb/civetweb.git) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-11-01T16:48:12.5410449Z Submodule 'googletest' (https://github.com/google/googletest.git) registered for path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-11-01T16:48:12.5453428Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb'... 2024-11-01T16:48:14.4254514Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest'... 2024-11-01T16:48:16.0289351Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb': checked out 'eefb26f82b233268fc98577d265352720d477ba4' 2024-11-01T16:48:16.0879545Z Submodule path 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest': checked out 'e2239ee6043f73722e7aa812a459f54a28552929' 2024-11-01T16:48:16.5948056Z Submodule path 'third_party/opentelemetry-cpp/tools/vcpkg': checked out '8eb57355a4ffb410a2e94c07b4dca2dffbee8e50' 2024-11-01T16:48:16.6297875Z Submodule path 'third_party/pocketfft': checked out '9d3ab05a7fffbc71a492bc6a17be034e83e8f0fe' 2024-11-01T16:48:16.9182668Z Submodule path 'third_party/protobuf': checked out 'd1eca4e4b421cd2997495c4b4e65cea6be4e9b8a' 2024-11-01T16:48:16.9275652Z Submodule 'third_party/benchmark' (https://github.com/google/benchmark.git) registered for path 'third_party/protobuf/third_party/benchmark' 2024-11-01T16:48:16.9278803Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/protobuf/third_party/googletest' 2024-11-01T16:48:16.9334452Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/benchmark'... 2024-11-01T16:48:17.8508469Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/protobuf/third_party/googletest'... 2024-11-01T16:48:19.1208589Z Submodule path 'third_party/protobuf/third_party/benchmark': checked out '5b7683f49e1e9223cf9927b24f6fd3d6bd82e3f8' 2024-11-01T16:48:19.2004572Z Submodule path 'third_party/protobuf/third_party/googletest': checked out '5ec7f0c4a113e2f18ac2c6cc7df51ad6afc24081' 2024-11-01T16:48:19.2254735Z Submodule path 'third_party/psimd': checked out '072586a71b55b7f8c584153d223e95687148a900' 2024-11-01T16:48:19.2550357Z Submodule path 'third_party/pthreadpool': checked out '4fe0e1e183925bf8cfa6aae24237e724a96479b8' 2024-11-01T16:48:19.3038903Z Submodule path 'third_party/pybind11': checked out 'a2e59f0e7065404b44dfe92a28aca47ba1378dc4' 2024-11-01T16:48:19.3511240Z Submodule path 'third_party/python-peachpy': checked out 'f45429b087dd7d5bc78bb40dc7cf06425c252d67' 2024-11-01T16:48:19.4068982Z Submodule path 'third_party/sleef': checked out '60e76d2bce17d278b439d9da17177c8f957a9e9b' 2024-11-01T16:48:19.4474158Z Submodule path 'third_party/tensorpipe': checked out '52791a2fd214b2a9dc5759d36725909c1daa7f2e' 2024-11-01T16:48:19.4537548Z Submodule 'third_party/googletest' (https://github.com/google/googletest.git) registered for path 'third_party/tensorpipe/third_party/googletest' 2024-11-01T16:48:19.4540661Z Submodule 'third_party/libnop' (https://github.com/google/libnop.git) registered for path 'third_party/tensorpipe/third_party/libnop' 2024-11-01T16:48:19.4543482Z Submodule 'third_party/libuv' (https://github.com/libuv/libuv.git) registered for path 'third_party/tensorpipe/third_party/libuv' 2024-11-01T16:48:19.4547214Z Submodule 'third_party/pybind11' (https://github.com/pybind/pybind11.git) registered for path 'third_party/tensorpipe/third_party/pybind11' 2024-11-01T16:48:19.4589209Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/googletest'... 2024-11-01T16:48:20.7209933Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libnop'... 2024-11-01T16:48:21.3598028Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/libuv'... 2024-11-01T16:48:23.0291374Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11'... 2024-11-01T16:48:24.3643010Z Submodule path 'third_party/tensorpipe/third_party/googletest': checked out 'aee0f9d9b5b87796ee8a0ab26b7587ec30e8858e' 2024-11-01T16:48:24.3955650Z Submodule path 'third_party/tensorpipe/third_party/libnop': checked out '910b55815be16109f04f4180e9adee14fb4ce281' 2024-11-01T16:48:24.4746953Z Submodule path 'third_party/tensorpipe/third_party/libuv': checked out '1dff88e5161cba5c59276d2070d2e304e4dcb242' 2024-11-01T16:48:24.5170572Z Submodule path 'third_party/tensorpipe/third_party/pybind11': checked out 'a23996fce38ff6ccfbcdc09f1e63f2c4be5ea2ef' 2024-11-01T16:48:24.5233243Z Submodule 'tools/clang' (https://github.com/wjakob/clang-cindex-python3) registered for path 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-11-01T16:48:24.5278257Z Cloning into '/home/pytorchci/actions-runner/_work/pytorch/pytorch/third_party/tensorpipe/third_party/pybind11/tools/clang'... 2024-11-01T16:48:25.1027808Z Submodule path 'third_party/tensorpipe/third_party/pybind11/tools/clang': checked out '6a00cbc4a9b8e68b71caf7f774b3f9c753ae84d5' 2024-11-01T16:48:25.1136331Z [command]/usr/bin/git submodule foreach --recursive git config --local gc.auto 0 2024-11-01T16:48:25.1399439Z Entering 'android/libs/fbjni' 2024-11-01T16:48:25.1442735Z Entering 'third_party/FP16' 2024-11-01T16:48:25.1481693Z Entering 'third_party/FXdiv' 2024-11-01T16:48:25.1516691Z Entering 'third_party/NNPACK' 2024-11-01T16:48:25.1556770Z Entering 'third_party/NVTX' 2024-11-01T16:48:25.1599252Z Entering 'third_party/VulkanMemoryAllocator' 2024-11-01T16:48:25.1634377Z Entering 'third_party/XNNPACK' 2024-11-01T16:48:25.1682556Z Entering 'third_party/benchmark' 2024-11-01T16:48:25.1717507Z Entering 'third_party/composable_kernel' 2024-11-01T16:48:25.1761749Z Entering 'third_party/cpp-httplib' 2024-11-01T16:48:25.1798213Z Entering 'third_party/cpuinfo' 2024-11-01T16:48:25.1833537Z Entering 'third_party/cudnn_frontend' 2024-11-01T16:48:25.1868809Z Entering 'third_party/cutlass' 2024-11-01T16:48:25.1909929Z Entering 'third_party/eigen' 2024-11-01T16:48:25.1947141Z Entering 'third_party/fbgemm' 2024-11-01T16:48:25.1990366Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-11-01T16:48:25.2026462Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-11-01T16:48:25.2059879Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-11-01T16:48:25.2105124Z Entering 'third_party/fbgemm/third_party/googletest' 2024-11-01T16:48:25.2137699Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-11-01T16:48:25.2184956Z Entering 'third_party/flatbuffers' 2024-11-01T16:48:25.2225176Z Entering 'third_party/fmt' 2024-11-01T16:48:25.2265534Z Entering 'third_party/gemmlowp/gemmlowp' 2024-11-01T16:48:25.2307028Z Entering 'third_party/gloo' 2024-11-01T16:48:25.2349577Z Entering 'third_party/googletest' 2024-11-01T16:48:25.2385663Z Entering 'third_party/ideep' 2024-11-01T16:48:25.2427895Z Entering 'third_party/ideep/mkl-dnn' 2024-11-01T16:48:25.2477062Z Entering 'third_party/ittapi' 2024-11-01T16:48:25.2519530Z Entering 'third_party/kineto' 2024-11-01T16:48:25.2561777Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-11-01T16:48:25.2601065Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-11-01T16:48:25.2636517Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-11-01T16:48:25.2673757Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-11-01T16:48:25.2711135Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-11-01T16:48:25.2745319Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-11-01T16:48:25.2783240Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-11-01T16:48:25.2825869Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-11-01T16:48:25.2878780Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-11-01T16:48:25.2917443Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-11-01T16:48:25.2958730Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-11-01T16:48:25.2999423Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-11-01T16:48:25.3044423Z Entering 'third_party/mimalloc' 2024-11-01T16:48:25.3082013Z Entering 'third_party/nccl/nccl' 2024-11-01T16:48:25.3118515Z Entering 'third_party/nlohmann' 2024-11-01T16:48:25.3155874Z Entering 'third_party/onnx' 2024-11-01T16:48:25.3210620Z Entering 'third_party/onnx/third_party/pybind11' 2024-11-01T16:48:25.3264017Z Entering 'third_party/opentelemetry-cpp' 2024-11-01T16:48:25.3309348Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-11-01T16:48:25.3350099Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-11-01T16:48:25.3385489Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-11-01T16:48:25.3429324Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-11-01T16:48:25.3464104Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-11-01T16:48:25.3496888Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-11-01T16:48:25.3537493Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-11-01T16:48:25.3583476Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-11-01T16:48:25.3627647Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-11-01T16:48:25.3670939Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-11-01T16:48:25.3722549Z Entering 'third_party/pocketfft' 2024-11-01T16:48:25.3762183Z Entering 'third_party/protobuf' 2024-11-01T16:48:25.3803165Z Entering 'third_party/protobuf/third_party/benchmark' 2024-11-01T16:48:25.3840391Z Entering 'third_party/protobuf/third_party/googletest' 2024-11-01T16:48:25.3881070Z Entering 'third_party/psimd' 2024-11-01T16:48:25.3921382Z Entering 'third_party/pthreadpool' 2024-11-01T16:48:25.3958509Z Entering 'third_party/pybind11' 2024-11-01T16:48:25.3998299Z Entering 'third_party/python-peachpy' 2024-11-01T16:48:25.4040786Z Entering 'third_party/sleef' 2024-11-01T16:48:25.4080515Z Entering 'third_party/tensorpipe' 2024-11-01T16:48:25.4115594Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-11-01T16:48:25.4157777Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-11-01T16:48:25.4202308Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-11-01T16:48:25.4239762Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-11-01T16:48:25.4272024Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-11-01T16:48:25.4327002Z ##[endgroup] 2024-11-01T16:48:25.4331327Z ##[group]Persisting credentials for submodules 2024-11-01T16:48:25.4334227Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'url\.https\:\/\/github\.com\/\.insteadOf' && git config --local --unset-all 'url.https://github.com/.insteadOf' || :" 2024-11-01T16:48:25.4588422Z Entering 'android/libs/fbjni' 2024-11-01T16:48:25.4638772Z Entering 'third_party/FP16' 2024-11-01T16:48:25.4688167Z Entering 'third_party/FXdiv' 2024-11-01T16:48:25.4730342Z Entering 'third_party/NNPACK' 2024-11-01T16:48:25.4768703Z Entering 'third_party/NVTX' 2024-11-01T16:48:25.4809904Z Entering 'third_party/VulkanMemoryAllocator' 2024-11-01T16:48:25.4852126Z Entering 'third_party/XNNPACK' 2024-11-01T16:48:25.4907100Z Entering 'third_party/benchmark' 2024-11-01T16:48:25.4951172Z Entering 'third_party/composable_kernel' 2024-11-01T16:48:25.5001314Z Entering 'third_party/cpp-httplib' 2024-11-01T16:48:25.5042032Z Entering 'third_party/cpuinfo' 2024-11-01T16:48:25.5081992Z Entering 'third_party/cudnn_frontend' 2024-11-01T16:48:25.5123264Z Entering 'third_party/cutlass' 2024-11-01T16:48:25.5171565Z Entering 'third_party/eigen' 2024-11-01T16:48:25.5217644Z Entering 'third_party/fbgemm' 2024-11-01T16:48:25.5264678Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-11-01T16:48:25.5306072Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-11-01T16:48:25.5350428Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-11-01T16:48:25.5395097Z Entering 'third_party/fbgemm/third_party/googletest' 2024-11-01T16:48:25.5441549Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-11-01T16:48:25.5486846Z Entering 'third_party/flatbuffers' 2024-11-01T16:48:25.5543363Z Entering 'third_party/fmt' 2024-11-01T16:48:25.5597152Z Entering 'third_party/gemmlowp/gemmlowp' 2024-11-01T16:48:25.5643666Z Entering 'third_party/gloo' 2024-11-01T16:48:25.5683083Z Entering 'third_party/googletest' 2024-11-01T16:48:25.5722234Z Entering 'third_party/ideep' 2024-11-01T16:48:25.5762133Z Entering 'third_party/ideep/mkl-dnn' 2024-11-01T16:48:25.5813842Z Entering 'third_party/ittapi' 2024-11-01T16:48:25.5866392Z Entering 'third_party/kineto' 2024-11-01T16:48:25.5913592Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-11-01T16:48:25.5958246Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-11-01T16:48:25.6004557Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-11-01T16:48:25.6042535Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-11-01T16:48:25.6082065Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-11-01T16:48:25.6122735Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-11-01T16:48:25.6166922Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-11-01T16:48:25.6208296Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-11-01T16:48:25.6248915Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-11-01T16:48:25.6292926Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-11-01T16:48:25.6335133Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-11-01T16:48:25.6375577Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-11-01T16:48:25.6422857Z Entering 'third_party/mimalloc' 2024-11-01T16:48:25.6466735Z Entering 'third_party/nccl/nccl' 2024-11-01T16:48:25.6508621Z Entering 'third_party/nlohmann' 2024-11-01T16:48:25.6549060Z Entering 'third_party/onnx' 2024-11-01T16:48:25.6605826Z Entering 'third_party/onnx/third_party/pybind11' 2024-11-01T16:48:25.6654899Z Entering 'third_party/opentelemetry-cpp' 2024-11-01T16:48:25.6700102Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-11-01T16:48:25.6745068Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-11-01T16:48:25.6784570Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-11-01T16:48:25.6821007Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-11-01T16:48:25.6866829Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-11-01T16:48:25.6907599Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-11-01T16:48:25.6946567Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-11-01T16:48:25.6987771Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-11-01T16:48:25.7033736Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-11-01T16:48:25.7074680Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-11-01T16:48:25.7135521Z Entering 'third_party/pocketfft' 2024-11-01T16:48:25.7179319Z Entering 'third_party/protobuf' 2024-11-01T16:48:25.7222137Z Entering 'third_party/protobuf/third_party/benchmark' 2024-11-01T16:48:25.7265310Z Entering 'third_party/protobuf/third_party/googletest' 2024-11-01T16:48:25.7309789Z Entering 'third_party/psimd' 2024-11-01T16:48:25.7353980Z Entering 'third_party/pthreadpool' 2024-11-01T16:48:25.7396136Z Entering 'third_party/pybind11' 2024-11-01T16:48:25.7440331Z Entering 'third_party/python-peachpy' 2024-11-01T16:48:25.7482070Z Entering 'third_party/sleef' 2024-11-01T16:48:25.7523966Z Entering 'third_party/tensorpipe' 2024-11-01T16:48:25.7561131Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-11-01T16:48:25.7603467Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-11-01T16:48:25.7643153Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-11-01T16:48:25.7681832Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-11-01T16:48:25.7719757Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-11-01T16:48:25.7789422Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local 'http.https://github.com/.extraheader' 'AUTHORIZATION: basic ***' && git config --local --show-origin --name-only --get-regexp remote.origin.url" 2024-11-01T16:48:25.8051381Z Entering 'android/libs/fbjni' 2024-11-01T16:48:25.8093206Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/android/libs/fbjni/config remote.origin.url 2024-11-01T16:48:25.8111965Z Entering 'third_party/FP16' 2024-11-01T16:48:25.8148827Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FP16/config remote.origin.url 2024-11-01T16:48:25.8166725Z Entering 'third_party/FXdiv' 2024-11-01T16:48:25.8201240Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/FXdiv/config remote.origin.url 2024-11-01T16:48:25.8218846Z Entering 'third_party/NNPACK' 2024-11-01T16:48:25.8252854Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK/config remote.origin.url 2024-11-01T16:48:25.8270748Z Entering 'third_party/NVTX' 2024-11-01T16:48:25.8305637Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NVTX/config remote.origin.url 2024-11-01T16:48:25.8323648Z Entering 'third_party/VulkanMemoryAllocator' 2024-11-01T16:48:25.8359002Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/VulkanMemoryAllocator/config remote.origin.url 2024-11-01T16:48:25.8376871Z Entering 'third_party/XNNPACK' 2024-11-01T16:48:25.8415928Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/XNNPACK/config remote.origin.url 2024-11-01T16:48:25.8447193Z Entering 'third_party/benchmark' 2024-11-01T16:48:25.8493479Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/benchmark/config remote.origin.url 2024-11-01T16:48:25.8512386Z Entering 'third_party/composable_kernel' 2024-11-01T16:48:25.8546390Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/composable_kernel/config remote.origin.url 2024-11-01T16:48:25.8568589Z Entering 'third_party/cpp-httplib' 2024-11-01T16:48:25.8607111Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpp-httplib/config remote.origin.url 2024-11-01T16:48:25.8628648Z Entering 'third_party/cpuinfo' 2024-11-01T16:48:25.8665964Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cpuinfo/config remote.origin.url 2024-11-01T16:48:25.8684539Z Entering 'third_party/cudnn_frontend' 2024-11-01T16:48:25.8718708Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cudnn_frontend/config remote.origin.url 2024-11-01T16:48:25.8736409Z Entering 'third_party/cutlass' 2024-11-01T16:48:25.8777037Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/cutlass/config remote.origin.url 2024-11-01T16:48:25.8804275Z Entering 'third_party/eigen' 2024-11-01T16:48:25.8840744Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/eigen/config remote.origin.url 2024-11-01T16:48:25.8864183Z Entering 'third_party/fbgemm' 2024-11-01T16:48:25.8905677Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/config remote.origin.url 2024-11-01T16:48:25.8923429Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-11-01T16:48:25.8963039Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/asmjit/config remote.origin.url 2024-11-01T16:48:25.8982494Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-11-01T16:48:25.9022393Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cpuinfo/config remote.origin.url 2024-11-01T16:48:25.9043017Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-11-01T16:48:25.9079160Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/cutlass/config remote.origin.url 2024-11-01T16:48:25.9103999Z Entering 'third_party/fbgemm/third_party/googletest' 2024-11-01T16:48:25.9144305Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/googletest/config remote.origin.url 2024-11-01T16:48:25.9163202Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-11-01T16:48:25.9198389Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fbgemm/modules/third_party/hipify_torch/config remote.origin.url 2024-11-01T16:48:25.9221633Z Entering 'third_party/flatbuffers' 2024-11-01T16:48:25.9255297Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/flatbuffers/config remote.origin.url 2024-11-01T16:48:25.9282662Z Entering 'third_party/fmt' 2024-11-01T16:48:25.9325685Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/fmt/config remote.origin.url 2024-11-01T16:48:25.9346994Z Entering 'third_party/gemmlowp/gemmlowp' 2024-11-01T16:48:25.9385456Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gemmlowp/gemmlowp/config remote.origin.url 2024-11-01T16:48:25.9404002Z Entering 'third_party/gloo' 2024-11-01T16:48:25.9440524Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/gloo/config remote.origin.url 2024-11-01T16:48:25.9458422Z Entering 'third_party/googletest' 2024-11-01T16:48:25.9497185Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/googletest/config remote.origin.url 2024-11-01T16:48:25.9516735Z Entering 'third_party/ideep' 2024-11-01T16:48:25.9556341Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/config remote.origin.url 2024-11-01T16:48:25.9573114Z Entering 'third_party/ideep/mkl-dnn' 2024-11-01T16:48:25.9615317Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ideep/modules/mkl-dnn/config remote.origin.url 2024-11-01T16:48:25.9645242Z Entering 'third_party/ittapi' 2024-11-01T16:48:25.9680453Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/ittapi/config remote.origin.url 2024-11-01T16:48:25.9698243Z Entering 'third_party/kineto' 2024-11-01T16:48:25.9735624Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/config remote.origin.url 2024-11-01T16:48:25.9756892Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-11-01T16:48:25.9797691Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/config remote.origin.url 2024-11-01T16:48:25.9815414Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-11-01T16:48:25.9855952Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/DCGM/config remote.origin.url 2024-11-01T16:48:25.9875307Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-11-01T16:48:25.9916768Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/cpr/config remote.origin.url 2024-11-01T16:48:25.9933735Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-11-01T16:48:25.9972267Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/fmt/config remote.origin.url 2024-11-01T16:48:25.9990373Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-11-01T16:48:26.0023999Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/config remote.origin.url 2024-11-01T16:48:26.0040232Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-11-01T16:48:26.0076365Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/gflags/modules/doc/config remote.origin.url 2024-11-01T16:48:26.0101583Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-11-01T16:48:26.0147297Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/glog/config remote.origin.url 2024-11-01T16:48:26.0165044Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-11-01T16:48:26.0203945Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/googletest/config remote.origin.url 2024-11-01T16:48:26.0226139Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-11-01T16:48:26.0263134Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/json/config remote.origin.url 2024-11-01T16:48:26.0282967Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-11-01T16:48:26.0321679Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/dynolog/modules/third_party/pfs/config remote.origin.url 2024-11-01T16:48:26.0343656Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-11-01T16:48:26.0384330Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/fmt/config remote.origin.url 2024-11-01T16:48:26.0401676Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-11-01T16:48:26.0436044Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/kineto/modules/libkineto/third_party/googletest/config remote.origin.url 2024-11-01T16:48:26.0456729Z Entering 'third_party/mimalloc' 2024-11-01T16:48:26.0507075Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/mimalloc/config remote.origin.url 2024-11-01T16:48:26.0524009Z Entering 'third_party/nccl/nccl' 2024-11-01T16:48:26.0558911Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nccl/nccl/config remote.origin.url 2024-11-01T16:48:26.0576635Z Entering 'third_party/nlohmann' 2024-11-01T16:48:26.0613757Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/nlohmann/config remote.origin.url 2024-11-01T16:48:26.0631750Z Entering 'third_party/onnx' 2024-11-01T16:48:26.0667131Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/config remote.origin.url 2024-11-01T16:48:26.0697842Z Entering 'third_party/onnx/third_party/pybind11' 2024-11-01T16:48:26.0735069Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/onnx/modules/third_party/pybind11/config remote.origin.url 2024-11-01T16:48:26.0758815Z Entering 'third_party/opentelemetry-cpp' 2024-11-01T16:48:26.0797408Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/config remote.origin.url 2024-11-01T16:48:26.0820270Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-11-01T16:48:26.0858039Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/benchmark/config remote.origin.url 2024-11-01T16:48:26.0878806Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-11-01T16:48:26.0924382Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/googletest/config remote.origin.url 2024-11-01T16:48:26.0941742Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-11-01T16:48:26.0982346Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/ms-gsl/config remote.origin.url 2024-11-01T16:48:26.1000107Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-11-01T16:48:26.1034248Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/nlohmann-json/config remote.origin.url 2024-11-01T16:48:26.1051957Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-11-01T16:48:26.1095022Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentelemetry-proto/config remote.origin.url 2024-11-01T16:48:26.1117491Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-11-01T16:48:26.1154054Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/opentracing-cpp/config remote.origin.url 2024-11-01T16:48:26.1170396Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-11-01T16:48:26.1202806Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/config remote.origin.url 2024-11-01T16:48:26.1218982Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-11-01T16:48:26.1264043Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/civetweb/config remote.origin.url 2024-11-01T16:48:26.1283702Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-11-01T16:48:26.1319508Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/third_party/prometheus-cpp/modules/googletest/config remote.origin.url 2024-11-01T16:48:26.1342465Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-11-01T16:48:26.1384332Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/opentelemetry-cpp/modules/tools/vcpkg/config remote.origin.url 2024-11-01T16:48:26.1420773Z Entering 'third_party/pocketfft' 2024-11-01T16:48:26.1455345Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pocketfft/config remote.origin.url 2024-11-01T16:48:26.1473816Z Entering 'third_party/protobuf' 2024-11-01T16:48:26.1514117Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/config remote.origin.url 2024-11-01T16:48:26.1537343Z Entering 'third_party/protobuf/third_party/benchmark' 2024-11-01T16:48:26.1577522Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/benchmark/config remote.origin.url 2024-11-01T16:48:26.1596168Z Entering 'third_party/protobuf/third_party/googletest' 2024-11-01T16:48:26.1637165Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/protobuf/modules/third_party/googletest/config remote.origin.url 2024-11-01T16:48:26.1657897Z Entering 'third_party/psimd' 2024-11-01T16:48:26.1693775Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/psimd/config remote.origin.url 2024-11-01T16:48:26.1711347Z Entering 'third_party/pthreadpool' 2024-11-01T16:48:26.1744641Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/NNPACK_deps/pthreadpool/config remote.origin.url 2024-11-01T16:48:26.1762819Z Entering 'third_party/pybind11' 2024-11-01T16:48:26.1797773Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/pybind11/config remote.origin.url 2024-11-01T16:48:26.1815554Z Entering 'third_party/python-peachpy' 2024-11-01T16:48:26.1855307Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/python-peachpy/config remote.origin.url 2024-11-01T16:48:26.1872382Z Entering 'third_party/sleef' 2024-11-01T16:48:26.1910540Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/sleef/config remote.origin.url 2024-11-01T16:48:26.1928585Z Entering 'third_party/tensorpipe' 2024-11-01T16:48:26.1972686Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/config remote.origin.url 2024-11-01T16:48:26.1989520Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-11-01T16:48:26.2027479Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/googletest/config remote.origin.url 2024-11-01T16:48:26.2045444Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-11-01T16:48:26.2081316Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libnop/config remote.origin.url 2024-11-01T16:48:26.2097985Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-11-01T16:48:26.2136417Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/libuv/config remote.origin.url 2024-11-01T16:48:26.2157591Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-11-01T16:48:26.2205381Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/config remote.origin.url 2024-11-01T16:48:26.2227155Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-11-01T16:48:26.2265355Z file:/home/pytorchci/actions-runner/_work/pytorch/pytorch/.git/modules/third_party/tensorpipe/modules/third_party/pybind11/modules/tools/clang/config remote.origin.url 2024-11-01T16:48:26.2632534Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'git@github.com:' 2024-11-01T16:48:26.2898916Z Entering 'android/libs/fbjni' 2024-11-01T16:48:26.2939823Z Entering 'third_party/FP16' 2024-11-01T16:48:26.2977929Z Entering 'third_party/FXdiv' 2024-11-01T16:48:26.3018458Z Entering 'third_party/NNPACK' 2024-11-01T16:48:26.3058254Z Entering 'third_party/NVTX' 2024-11-01T16:48:26.3097216Z Entering 'third_party/VulkanMemoryAllocator' 2024-11-01T16:48:26.3138581Z Entering 'third_party/XNNPACK' 2024-11-01T16:48:26.3190417Z Entering 'third_party/benchmark' 2024-11-01T16:48:26.3229404Z Entering 'third_party/composable_kernel' 2024-11-01T16:48:26.3269828Z Entering 'third_party/cpp-httplib' 2024-11-01T16:48:26.3309107Z Entering 'third_party/cpuinfo' 2024-11-01T16:48:26.3345822Z Entering 'third_party/cudnn_frontend' 2024-11-01T16:48:26.3383427Z Entering 'third_party/cutlass' 2024-11-01T16:48:26.3432902Z Entering 'third_party/eigen' 2024-11-01T16:48:26.3475787Z Entering 'third_party/fbgemm' 2024-11-01T16:48:26.3512265Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-11-01T16:48:26.3547098Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-11-01T16:48:26.3585525Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-11-01T16:48:26.3627061Z Entering 'third_party/fbgemm/third_party/googletest' 2024-11-01T16:48:26.3666464Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-11-01T16:48:26.3710371Z Entering 'third_party/flatbuffers' 2024-11-01T16:48:26.3752210Z Entering 'third_party/fmt' 2024-11-01T16:48:26.3795614Z Entering 'third_party/gemmlowp/gemmlowp' 2024-11-01T16:48:26.3836819Z Entering 'third_party/gloo' 2024-11-01T16:48:26.3877698Z Entering 'third_party/googletest' 2024-11-01T16:48:26.3918094Z Entering 'third_party/ideep' 2024-11-01T16:48:26.3963353Z Entering 'third_party/ideep/mkl-dnn' 2024-11-01T16:48:26.4012858Z Entering 'third_party/ittapi' 2024-11-01T16:48:26.4054654Z Entering 'third_party/kineto' 2024-11-01T16:48:26.4095717Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-11-01T16:48:26.4139141Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-11-01T16:48:26.4181902Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-11-01T16:48:26.4215680Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-11-01T16:48:26.4258975Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-11-01T16:48:26.4296329Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-11-01T16:48:26.4339186Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-11-01T16:48:26.4378366Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-11-01T16:48:26.4419467Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-11-01T16:48:26.4458742Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-11-01T16:48:26.4502747Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-11-01T16:48:26.4549673Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-11-01T16:48:26.4587372Z Entering 'third_party/mimalloc' 2024-11-01T16:48:26.4629457Z Entering 'third_party/nccl/nccl' 2024-11-01T16:48:26.4677470Z Entering 'third_party/nlohmann' 2024-11-01T16:48:26.4722067Z Entering 'third_party/onnx' 2024-11-01T16:48:26.4772754Z Entering 'third_party/onnx/third_party/pybind11' 2024-11-01T16:48:26.4822764Z Entering 'third_party/opentelemetry-cpp' 2024-11-01T16:48:26.4868998Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-11-01T16:48:26.4908193Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-11-01T16:48:26.4947354Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-11-01T16:48:26.4985990Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-11-01T16:48:26.5027919Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-11-01T16:48:26.5066948Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-11-01T16:48:26.5106539Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-11-01T16:48:26.5145776Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-11-01T16:48:26.5189611Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-11-01T16:48:26.5228048Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-11-01T16:48:26.5288780Z Entering 'third_party/pocketfft' 2024-11-01T16:48:26.5335014Z Entering 'third_party/protobuf' 2024-11-01T16:48:26.5378959Z Entering 'third_party/protobuf/third_party/benchmark' 2024-11-01T16:48:26.5417140Z Entering 'third_party/protobuf/third_party/googletest' 2024-11-01T16:48:26.5454101Z Entering 'third_party/psimd' 2024-11-01T16:48:26.5489198Z Entering 'third_party/pthreadpool' 2024-11-01T16:48:26.5524219Z Entering 'third_party/pybind11' 2024-11-01T16:48:26.5562539Z Entering 'third_party/python-peachpy' 2024-11-01T16:48:26.5602693Z Entering 'third_party/sleef' 2024-11-01T16:48:26.5642166Z Entering 'third_party/tensorpipe' 2024-11-01T16:48:26.5681455Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-11-01T16:48:26.5721132Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-11-01T16:48:26.5756518Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-11-01T16:48:26.5797421Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-11-01T16:48:26.5837267Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-11-01T16:48:26.5920676Z [command]/usr/bin/git submodule foreach --recursive git config --local --add 'url.https://github.com/.insteadOf' 'org-21003710@github.com:' 2024-11-01T16:48:26.6178707Z Entering 'android/libs/fbjni' 2024-11-01T16:48:26.6225812Z Entering 'third_party/FP16' 2024-11-01T16:48:26.6267235Z Entering 'third_party/FXdiv' 2024-11-01T16:48:26.6309239Z Entering 'third_party/NNPACK' 2024-11-01T16:48:26.6348996Z Entering 'third_party/NVTX' 2024-11-01T16:48:26.6390524Z Entering 'third_party/VulkanMemoryAllocator' 2024-11-01T16:48:26.6435130Z Entering 'third_party/XNNPACK' 2024-11-01T16:48:26.6486082Z Entering 'third_party/benchmark' 2024-11-01T16:48:26.6538997Z Entering 'third_party/composable_kernel' 2024-11-01T16:48:26.6591041Z Entering 'third_party/cpp-httplib' 2024-11-01T16:48:26.6631314Z Entering 'third_party/cpuinfo' 2024-11-01T16:48:26.6672538Z Entering 'third_party/cudnn_frontend' 2024-11-01T16:48:26.6712111Z Entering 'third_party/cutlass' 2024-11-01T16:48:26.6755615Z Entering 'third_party/eigen' 2024-11-01T16:48:26.6796319Z Entering 'third_party/fbgemm' 2024-11-01T16:48:26.6836214Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-11-01T16:48:26.6882325Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-11-01T16:48:26.6923274Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-11-01T16:48:26.6968837Z Entering 'third_party/fbgemm/third_party/googletest' 2024-11-01T16:48:26.7009346Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-11-01T16:48:26.7046638Z Entering 'third_party/flatbuffers' 2024-11-01T16:48:26.7087361Z Entering 'third_party/fmt' 2024-11-01T16:48:26.7128826Z Entering 'third_party/gemmlowp/gemmlowp' 2024-11-01T16:48:26.7164420Z Entering 'third_party/gloo' 2024-11-01T16:48:26.7202583Z Entering 'third_party/googletest' 2024-11-01T16:48:26.7242250Z Entering 'third_party/ideep' 2024-11-01T16:48:26.7277262Z Entering 'third_party/ideep/mkl-dnn' 2024-11-01T16:48:26.7322569Z Entering 'third_party/ittapi' 2024-11-01T16:48:26.7362941Z Entering 'third_party/kineto' 2024-11-01T16:48:26.7401549Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-11-01T16:48:26.7441354Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-11-01T16:48:26.7483140Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-11-01T16:48:26.7518213Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-11-01T16:48:26.7554010Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-11-01T16:48:26.7586378Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-11-01T16:48:26.7632011Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-11-01T16:48:26.7666332Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-11-01T16:48:26.7699097Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-11-01T16:48:26.7745454Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-11-01T16:48:26.7782817Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-11-01T16:48:26.7827964Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-11-01T16:48:26.7871714Z Entering 'third_party/mimalloc' 2024-11-01T16:48:26.7911244Z Entering 'third_party/nccl/nccl' 2024-11-01T16:48:26.7951304Z Entering 'third_party/nlohmann' 2024-11-01T16:48:26.7991563Z Entering 'third_party/onnx' 2024-11-01T16:48:26.8042519Z Entering 'third_party/onnx/third_party/pybind11' 2024-11-01T16:48:26.8088428Z Entering 'third_party/opentelemetry-cpp' 2024-11-01T16:48:26.8128856Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-11-01T16:48:26.8165349Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-11-01T16:48:26.8198994Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-11-01T16:48:26.8240246Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-11-01T16:48:26.8277107Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-11-01T16:48:26.8317394Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-11-01T16:48:26.8362101Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-11-01T16:48:26.8401210Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-11-01T16:48:26.8448981Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-11-01T16:48:26.8491393Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-11-01T16:48:26.8553387Z Entering 'third_party/pocketfft' 2024-11-01T16:48:26.8589336Z Entering 'third_party/protobuf' 2024-11-01T16:48:26.8632008Z Entering 'third_party/protobuf/third_party/benchmark' 2024-11-01T16:48:26.8671675Z Entering 'third_party/protobuf/third_party/googletest' 2024-11-01T16:48:26.8710467Z Entering 'third_party/psimd' 2024-11-01T16:48:26.8750551Z Entering 'third_party/pthreadpool' 2024-11-01T16:48:26.8790787Z Entering 'third_party/pybind11' 2024-11-01T16:48:26.8830671Z Entering 'third_party/python-peachpy' 2024-11-01T16:48:26.8866757Z Entering 'third_party/sleef' 2024-11-01T16:48:26.8908539Z Entering 'third_party/tensorpipe' 2024-11-01T16:48:26.8950028Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-11-01T16:48:26.8992397Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-11-01T16:48:26.9031804Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-11-01T16:48:26.9071784Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-11-01T16:48:26.9106479Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-11-01T16:48:26.9172490Z ##[endgroup] 2024-11-01T16:48:26.9207426Z [command]/usr/bin/git log -1 --format='%H' 2024-11-01T16:48:26.9235331Z 'd1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea' 2024-11-01T16:48:26.9445922Z Prepare all required actions 2024-11-01T16:48:26.9446457Z Getting action download info 2024-11-01T16:48:26.9509747Z ##[group]Run ./.github/actions/setup-rocm 2024-11-01T16:48:26.9510079Z env: 2024-11-01T16:48:26.9510291Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:26.9510557Z ##[endgroup] 2024-11-01T16:48:26.9529837Z ##[group]Run echo "DOCKER_HOST=unix:///run/user/$(id -u)/docker.sock" >> "${GITHUB_ENV}" 2024-11-01T16:48:26.9530481Z echo "DOCKER_HOST=unix:///run/user/$(id -u)/docker.sock" >> "${GITHUB_ENV}" 2024-11-01T16:48:26.9546623Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:26.9547006Z env: 2024-11-01T16:48:26.9547219Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:26.9547480Z ##[endgroup] 2024-11-01T16:48:26.9650717Z ##[group]Run set -ex 2024-11-01T16:48:26.9651332Z set -ex 2024-11-01T16:48:26.9651848Z  2024-11-01T16:48:26.9652435Z cat ~/.docker/config.json || true 2024-11-01T16:48:26.9654323Z # https://stackoverflow.com/questions/64455468/error-when-logging-into-ecr-with-docker-login-error-saving-credentials-not 2024-11-01T16:48:26.9656130Z rm -f ~/.docker/config.json 2024-11-01T16:48:26.9692187Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:26.9693095Z env: 2024-11-01T16:48:26.9693583Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:26.9694325Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:26.9695134Z ##[endgroup] 2024-11-01T16:48:26.9764931Z + cat /home/pytorchci/.docker/config.json 2024-11-01T16:48:26.9782783Z { 2024-11-01T16:48:26.9783260Z "auths": {} 2024-11-01T16:48:26.9785886Z + rm -f /home/pytorchci/.docker/config.json 2024-11-01T16:48:26.9805696Z } 2024-11-01T16:48:26.9846612Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2024-11-01T16:48:26.9847980Z # ignore expansion of "docker ps -q" since it could be empty 2024-11-01T16:48:26.9849187Z # shellcheck disable=SC2046 2024-11-01T16:48:26.9850151Z docker stop $(docker ps -q) || true 2024-11-01T16:48:26.9851121Z # Prune all stopped containers. 2024-11-01T16:48:26.9852513Z docker container prune -f 2024-11-01T16:48:26.9880219Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:26.9880682Z env: 2024-11-01T16:48:26.9880947Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:26.9881336Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:26.9881747Z ##[endgroup] 2024-11-01T16:48:27.0254287Z "docker stop" requires at least 1 argument. 2024-11-01T16:48:27.0255446Z See 'docker stop --help'. 2024-11-01T16:48:27.0255952Z 2024-11-01T16:48:27.0256444Z Usage: docker stop [OPTIONS] CONTAINER [CONTAINER...] 2024-11-01T16:48:27.0257272Z 2024-11-01T16:48:27.0257608Z Stop one or more running containers 2024-11-01T16:48:27.0439887Z Total reclaimed space: 0B 2024-11-01T16:48:27.0495778Z ##[group]Run cat /etc/os-release || true 2024-11-01T16:48:27.0496468Z cat /etc/os-release || true 2024-11-01T16:48:27.0497159Z cat /etc/apt/sources.list.d/rocm.list || true 2024-11-01T16:48:27.0497916Z cat /opt/rocm/.info/version || true 2024-11-01T16:48:27.0498692Z whoami 2024-11-01T16:48:27.0524284Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:27.0524988Z env: 2024-11-01T16:48:27.0525372Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.0525939Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.0526555Z ##[endgroup] 2024-11-01T16:48:27.0587182Z PRETTY_NAME="Ubuntu 22.04.3 LTS" 2024-11-01T16:48:27.0587845Z NAME="Ubuntu" 2024-11-01T16:48:27.0588449Z VERSION_ID="22.04" 2024-11-01T16:48:27.0588937Z VERSION="22.04.3 LTS (Jammy Jellyfish)" 2024-11-01T16:48:27.0589348Z VERSION_CODENAME=jammy 2024-11-01T16:48:27.0589665Z ID=ubuntu 2024-11-01T16:48:27.0589949Z ID_LIKE=debian 2024-11-01T16:48:27.0592244Z HOME_URL="https://www.ubuntu.com/" 2024-11-01T16:48:27.0592727Z SUPPORT_URL="https://help.ubuntu.com/" 2024-11-01T16:48:27.0595319Z BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" 2024-11-01T16:48:27.0596157Z PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" 2024-11-01T16:48:27.0596758Z UBUNTU_CODENAME=jammy 2024-11-01T16:48:27.0597127Z deb [arch=amd64] https://repo.radeon.com/rocm/apt/6.2.1 jammy main 2024-11-01T16:48:27.0607063Z 6.2.1-112 2024-11-01T16:48:27.0620779Z pytorchci 2024-11-01T16:48:27.0642893Z ##[group]Run rocm-smi 2024-11-01T16:48:27.0643153Z rocm-smi 2024-11-01T16:48:27.0656557Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:27.0656945Z env: 2024-11-01T16:48:27.0657172Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.0657489Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.0657830Z ##[endgroup] 2024-11-01T16:48:27.1461405Z 2024-11-01T16:48:27.1461619Z 2024-11-01T16:48:27.1462286Z ========================================= ROCm System Management Interface ========================================= 2024-11-01T16:48:27.1463487Z =================================================== Concise Info =================================================== 2024-11-01T16:48:27.1464812Z Device Node IDs Temp Power Partitions SCLK MCLK Fan Perf PwrCap VRAM% GPU% 2024-11-01T16:48:27.1466454Z  (DID, GUID) (Edge) (Avg) (Mem, Compute, ID)  2024-11-01T16:48:27.1467524Z ==================================================================================================================== 2024-11-01T16:48:27.1468560Z 0 2 0x740f, 12261 45.0°C 38.0W N/A, N/A, 0 800Mhz 1600Mhz 0% auto 300.0W 0% 0% 2024-11-01T16:48:27.1469345Z 1 3 0x740f, 36740 42.0°C 45.0W N/A, N/A, 0 800Mhz 1600Mhz 0% auto 300.0W 0% 0% 2024-11-01T16:48:27.1469922Z ==================================================================================================================== 2024-11-01T16:48:27.1470443Z =============================================== End of ROCm SMI Log ================================================ 2024-11-01T16:48:27.1613745Z ##[group]Run rocminfo 2024-11-01T16:48:27.1614374Z rocminfo 2024-11-01T16:48:27.1648648Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:27.1649557Z env: 2024-11-01T16:48:27.1650053Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.1650782Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.1651567Z ##[endgroup] 2024-11-01T16:48:27.2394710Z ROCk module version 6.8.5 is loaded 2024-11-01T16:48:27.2395183Z ===================== 2024-11-01T16:48:27.2395543Z HSA System Attributes 2024-11-01T16:48:27.2395874Z ===================== 2024-11-01T16:48:27.2396196Z Runtime Version: 1.14 2024-11-01T16:48:27.2396555Z Runtime Ext Version: 1.6 2024-11-01T16:48:27.2396929Z System Timestamp Freq.: 1000.000000MHz 2024-11-01T16:48:27.2397681Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2024-11-01T16:48:27.2398467Z Machine Model: LARGE 2024-11-01T16:48:27.2399138Z System Endianness: LITTLE 2024-11-01T16:48:27.2399598Z Mwaitx: DISABLED 2024-11-01T16:48:27.2400009Z DMAbuf Support: YES 2024-11-01T16:48:27.2400256Z 2024-11-01T16:48:27.2400397Z ========== 2024-11-01T16:48:27.2400723Z HSA Agents 2024-11-01T16:48:27.2401059Z ========== 2024-11-01T16:48:27.2401367Z ******* 2024-11-01T16:48:27.2401665Z Agent 1 2024-11-01T16:48:27.2401963Z ******* 2024-11-01T16:48:27.2402431Z Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:48:27.2402999Z Uuid: CPU-XX 2024-11-01T16:48:27.2403581Z Marketing Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:48:27.2404466Z Vendor Name: CPU 2024-11-01T16:48:27.2405010Z Feature: None specified 2024-11-01T16:48:27.2405527Z Profile: FULL_PROFILE 2024-11-01T16:48:27.2406065Z Float Round Mode: NEAR 2024-11-01T16:48:27.2406609Z Max Queue Number: 0(0x0) 2024-11-01T16:48:27.2407134Z Queue Min Size: 0(0x0) 2024-11-01T16:48:27.2407655Z Queue Max Size: 0(0x0) 2024-11-01T16:48:27.2408157Z Queue Type: MULTI 2024-11-01T16:48:27.2408625Z Node: 0 2024-11-01T16:48:27.2409098Z Device Type: CPU 2024-11-01T16:48:27.2409538Z Cache Info: 2024-11-01T16:48:27.2409954Z L1: 32768(0x8000) KB 2024-11-01T16:48:27.2410465Z Chip ID: 0(0x0) 2024-11-01T16:48:27.2410967Z ASIC Revision: 0(0x0) 2024-11-01T16:48:27.2411453Z Cacheline Size: 64(0x40) 2024-11-01T16:48:27.2411988Z Max Clock Freq. (MHz): 2600 2024-11-01T16:48:27.2412504Z BDFID: 0 2024-11-01T16:48:27.2412996Z Internal Node ID: 0 2024-11-01T16:48:27.2413489Z Compute Unit: 32 2024-11-01T16:48:27.2413990Z SIMDs per CU: 0 2024-11-01T16:48:27.2414492Z Shader Engines: 0 2024-11-01T16:48:27.2415007Z Shader Arrs. per Eng.: 0 2024-11-01T16:48:27.2415604Z WatchPts on Addr. Ranges:1 2024-11-01T16:48:27.2416072Z Memory Properties: 2024-11-01T16:48:27.2416408Z Features: None 2024-11-01T16:48:27.2416975Z Pool Info: 2024-11-01T16:48:27.2417312Z Pool 1 2024-11-01T16:48:27.2417818Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-11-01T16:48:27.2418537Z Size: 65790800(0x3ebe350) KB 2024-11-01T16:48:27.2419053Z Allocatable: TRUE 2024-11-01T16:48:27.2419592Z Alloc Granule: 4KB 2024-11-01T16:48:27.2420159Z Alloc Recommended Granule:4KB 2024-11-01T16:48:27.2420718Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2421248Z Accessible by all: TRUE 2024-11-01T16:48:27.2421702Z Pool 2 2024-11-01T16:48:27.2422146Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-11-01T16:48:27.2422691Z Size: 65790800(0x3ebe350) KB 2024-11-01T16:48:27.2423230Z Allocatable: TRUE 2024-11-01T16:48:27.2423770Z Alloc Granule: 4KB 2024-11-01T16:48:27.2424314Z Alloc Recommended Granule:4KB 2024-11-01T16:48:27.2424877Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2425421Z Accessible by all: TRUE 2024-11-01T16:48:27.2425880Z Pool 3 2024-11-01T16:48:27.2426306Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-11-01T16:48:27.2426812Z Size: 65790800(0x3ebe350) KB 2024-11-01T16:48:27.2427309Z Allocatable: TRUE 2024-11-01T16:48:27.2428004Z Alloc Granule: 4KB 2024-11-01T16:48:27.2428562Z Alloc Recommended Granule:4KB 2024-11-01T16:48:27.2429121Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2429660Z Accessible by all: TRUE 2024-11-01T16:48:27.2430110Z ISA Info: 2024-11-01T16:48:27.2430431Z ******* 2024-11-01T16:48:27.2430743Z Agent 2 2024-11-01T16:48:27.2431042Z ******* 2024-11-01T16:48:27.2431491Z Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:48:27.2432040Z Uuid: CPU-XX 2024-11-01T16:48:27.2432617Z Marketing Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:48:27.2433133Z Vendor Name: CPU 2024-11-01T16:48:27.2433634Z Feature: None specified 2024-11-01T16:48:27.2434146Z Profile: FULL_PROFILE 2024-11-01T16:48:27.2434652Z Float Round Mode: NEAR 2024-11-01T16:48:27.2435181Z Max Queue Number: 0(0x0) 2024-11-01T16:48:27.2435693Z Queue Min Size: 0(0x0) 2024-11-01T16:48:27.2436179Z Queue Max Size: 0(0x0) 2024-11-01T16:48:27.2436672Z Queue Type: MULTI 2024-11-01T16:48:27.2437140Z Node: 1 2024-11-01T16:48:27.2437619Z Device Type: CPU 2024-11-01T16:48:27.2438035Z Cache Info: 2024-11-01T16:48:27.2438418Z L1: 32768(0x8000) KB 2024-11-01T16:48:27.2438910Z Chip ID: 0(0x0) 2024-11-01T16:48:27.2439397Z ASIC Revision: 0(0x0) 2024-11-01T16:48:27.2439903Z Cacheline Size: 64(0x40) 2024-11-01T16:48:27.2440570Z Max Clock Freq. (MHz): 2600 2024-11-01T16:48:27.2441053Z BDFID: 0 2024-11-01T16:48:27.2441564Z Internal Node ID: 1 2024-11-01T16:48:27.2442080Z Compute Unit: 32 2024-11-01T16:48:27.2442567Z SIMDs per CU: 0 2024-11-01T16:48:27.2443069Z Shader Engines: 0 2024-11-01T16:48:27.2443594Z Shader Arrs. per Eng.: 0 2024-11-01T16:48:27.2444114Z WatchPts on Addr. Ranges:1 2024-11-01T16:48:27.2444644Z Memory Properties: 2024-11-01T16:48:27.2445070Z Features: None 2024-11-01T16:48:27.2445482Z Pool Info: 2024-11-01T16:48:27.2445855Z Pool 1 2024-11-01T16:48:27.2446277Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-11-01T16:48:27.2446807Z Size: 66046476(0x3efca0c) KB 2024-11-01T16:48:27.2447326Z Allocatable: TRUE 2024-11-01T16:48:27.2447852Z Alloc Granule: 4KB 2024-11-01T16:48:27.2448401Z Alloc Recommended Granule:4KB 2024-11-01T16:48:27.2448945Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2449480Z Accessible by all: TRUE 2024-11-01T16:48:27.2449925Z Pool 2 2024-11-01T16:48:27.2450337Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-11-01T16:48:27.2451017Z Size: 66046476(0x3efca0c) KB 2024-11-01T16:48:27.2452205Z Allocatable: TRUE 2024-11-01T16:48:27.2453198Z Alloc Granule: 4KB 2024-11-01T16:48:27.2454244Z Alloc Recommended Granule:4KB 2024-11-01T16:48:27.2455294Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2456288Z Accessible by all: TRUE 2024-11-01T16:48:27.2457130Z Pool 3 2024-11-01T16:48:27.2457901Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-11-01T16:48:27.2459193Z Size: 66046476(0x3efca0c) KB 2024-11-01T16:48:27.2460338Z Allocatable: TRUE 2024-11-01T16:48:27.2461435Z Alloc Granule: 4KB 2024-11-01T16:48:27.2462463Z Alloc Recommended Granule:4KB 2024-11-01T16:48:27.2463508Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2464506Z Accessible by all: TRUE 2024-11-01T16:48:27.2465349Z ISA Info: 2024-11-01T16:48:27.2465926Z ******* 2024-11-01T16:48:27.2466489Z Agent 3 2024-11-01T16:48:27.2467047Z ******* 2024-11-01T16:48:27.2467714Z Name: gfx90a 2024-11-01T16:48:27.2468798Z Uuid: GPU-cc050612e1017d6b 2024-11-01T16:48:27.2469962Z Marketing Name: AMD Instinct MI210 2024-11-01T16:48:27.2471107Z Vendor Name: AMD 2024-11-01T16:48:27.2472242Z Feature: KERNEL_DISPATCH 2024-11-01T16:48:27.2473382Z Profile: BASE_PROFILE 2024-11-01T16:48:27.2474475Z Float Round Mode: NEAR 2024-11-01T16:48:27.2475104Z Max Queue Number: 128(0x80) 2024-11-01T16:48:27.2475616Z Queue Min Size: 64(0x40) 2024-11-01T16:48:27.2476273Z Queue Max Size: 131072(0x20000) 2024-11-01T16:48:27.2476765Z Queue Type: MULTI 2024-11-01T16:48:27.2477234Z Node: 2 2024-11-01T16:48:27.2477702Z Device Type: GPU 2024-11-01T16:48:27.2478124Z Cache Info: 2024-11-01T16:48:27.2478511Z L1: 16(0x10) KB 2024-11-01T16:48:27.2478984Z L2: 8192(0x2000) KB 2024-11-01T16:48:27.2479458Z Chip ID: 29711(0x740f) 2024-11-01T16:48:27.2479954Z ASIC Revision: 1(0x1) 2024-11-01T16:48:27.2480512Z Cacheline Size: 64(0x40) 2024-11-01T16:48:27.2481435Z Max Clock Freq. (MHz): 1700 2024-11-01T16:48:27.2482500Z BDFID: 768 2024-11-01T16:48:27.2483418Z Internal Node ID: 2 2024-11-01T16:48:27.2484354Z Compute Unit: 104 2024-11-01T16:48:27.2485281Z SIMDs per CU: 4 2024-11-01T16:48:27.2486214Z Shader Engines: 8 2024-11-01T16:48:27.2487169Z Shader Arrs. per Eng.: 1 2024-11-01T16:48:27.2488182Z WatchPts on Addr. Ranges:4 2024-11-01T16:48:27.2489374Z Coherent Host Access: FALSE 2024-11-01T16:48:27.2490382Z Memory Properties: 2024-11-01T16:48:27.2491168Z Features: KERNEL_DISPATCH 2024-11-01T16:48:27.2492378Z Fast F16 Operation: TRUE 2024-11-01T16:48:27.2493363Z Wavefront Size: 64(0x40) 2024-11-01T16:48:27.2494337Z Workgroup Max Size: 1024(0x400) 2024-11-01T16:48:27.2495213Z Workgroup Max Size per Dimension: 2024-11-01T16:48:27.2496012Z x 1024(0x400) 2024-11-01T16:48:27.2496859Z y 1024(0x400) 2024-11-01T16:48:27.2497697Z z 1024(0x400) 2024-11-01T16:48:27.2498762Z Max Waves Per CU: 32(0x20) 2024-11-01T16:48:27.2499892Z Max Work-item Per CU: 2048(0x800) 2024-11-01T16:48:27.2500891Z Grid Max Size: 4294967295(0xffffffff) 2024-11-01T16:48:27.2501761Z Grid Max Size per Dimension: 2024-11-01T16:48:27.2502480Z x 4294967295(0xffffffff) 2024-11-01T16:48:27.2503360Z y 4294967295(0xffffffff) 2024-11-01T16:48:27.2504221Z z 4294967295(0xffffffff) 2024-11-01T16:48:27.2505148Z Max fbarriers/Workgrp: 32 2024-11-01T16:48:27.2526075Z Packet Processor uCode:: 83 2024-11-01T16:48:27.2527244Z SDMA engine uCode:: 8 2024-11-01T16:48:27.2528290Z IOMMU Support:: None 2024-11-01T16:48:27.2529141Z Pool Info: 2024-11-01T16:48:27.2529770Z Pool 1 2024-11-01T16:48:27.2530589Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-11-01T16:48:27.2531624Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:48:27.2532727Z Allocatable: TRUE 2024-11-01T16:48:27.2533749Z Alloc Granule: 4KB 2024-11-01T16:48:27.2534809Z Alloc Recommended Granule:2048KB 2024-11-01T16:48:27.2536375Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2537406Z Accessible by all: FALSE 2024-11-01T16:48:27.2538565Z Pool 2 2024-11-01T16:48:27.2539549Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2024-11-01T16:48:27.2540733Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:48:27.2541877Z Allocatable: TRUE 2024-11-01T16:48:27.2543060Z Alloc Granule: 4KB 2024-11-01T16:48:27.2544248Z Alloc Recommended Granule:2048KB 2024-11-01T16:48:27.2545317Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2546424Z Accessible by all: FALSE 2024-11-01T16:48:27.2547451Z Pool 3 2024-11-01T16:48:27.2548355Z Segment: GROUP 2024-11-01T16:48:27.2549367Z Size: 64(0x40) KB 2024-11-01T16:48:27.2550304Z Allocatable: FALSE 2024-11-01T16:48:27.2551283Z Alloc Granule: 0KB 2024-11-01T16:48:27.2552311Z Alloc Recommended Granule:0KB 2024-11-01T16:48:27.2553356Z Alloc Alignment: 0KB 2024-11-01T16:48:27.2554463Z Accessible by all: FALSE 2024-11-01T16:48:27.2555447Z ISA Info: 2024-11-01T16:48:27.2556152Z ISA 1 2024-11-01T16:48:27.2557363Z Name: amdgcn-amd-amdhsa--gfx90a:sramecc+:xnack- 2024-11-01T16:48:27.2559000Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-11-01T16:48:27.2559881Z Profiles: HSA_PROFILE_BASE 2024-11-01T16:48:27.2560477Z Default Rounding Mode: NEAR 2024-11-01T16:48:27.2561020Z Default Rounding Mode: NEAR 2024-11-01T16:48:27.2561536Z Fast f16: TRUE 2024-11-01T16:48:27.2562056Z Workgroup Max Size: 1024(0x400) 2024-11-01T16:48:27.2562530Z Workgroup Max Size per Dimension: 2024-11-01T16:48:27.2562974Z x 1024(0x400) 2024-11-01T16:48:27.2563439Z y 1024(0x400) 2024-11-01T16:48:27.2563897Z z 1024(0x400) 2024-11-01T16:48:27.2564398Z Grid Max Size: 4294967295(0xffffffff) 2024-11-01T16:48:27.2564871Z Grid Max Size per Dimension: 2024-11-01T16:48:27.2565278Z x 4294967295(0xffffffff) 2024-11-01T16:48:27.2565813Z y 4294967295(0xffffffff) 2024-11-01T16:48:27.2566783Z z 4294967295(0xffffffff) 2024-11-01T16:48:27.2567891Z FBarrier Max Size: 32 2024-11-01T16:48:27.2568841Z ******* 2024-11-01T16:48:27.2569404Z Agent 4 2024-11-01T16:48:27.2569954Z ******* 2024-11-01T16:48:27.2570630Z Name: gfx90a 2024-11-01T16:48:27.2571685Z Uuid: GPU-f7f1beca7bad9e24 2024-11-01T16:48:27.2572664Z Marketing Name: AMD Instinct MI210 2024-11-01T16:48:27.2573653Z Vendor Name: AMD 2024-11-01T16:48:27.2574623Z Feature: KERNEL_DISPATCH 2024-11-01T16:48:27.2575555Z Profile: BASE_PROFILE 2024-11-01T16:48:27.2576823Z Float Round Mode: NEAR 2024-11-01T16:48:27.2577803Z Max Queue Number: 128(0x80) 2024-11-01T16:48:27.2578919Z Queue Min Size: 64(0x40) 2024-11-01T16:48:27.2579851Z Queue Max Size: 131072(0x20000) 2024-11-01T16:48:27.2580811Z Queue Type: MULTI 2024-11-01T16:48:27.2581710Z Node: 3 2024-11-01T16:48:27.2582586Z Device Type: GPU 2024-11-01T16:48:27.2583390Z Cache Info: 2024-11-01T16:48:27.2584100Z L1: 16(0x10) KB 2024-11-01T16:48:27.2584962Z L2: 8192(0x2000) KB 2024-11-01T16:48:27.2585884Z Chip ID: 29711(0x740f) 2024-11-01T16:48:27.2586810Z ASIC Revision: 1(0x1) 2024-11-01T16:48:27.2587766Z Cacheline Size: 64(0x40) 2024-11-01T16:48:27.2588907Z Max Clock Freq. (MHz): 1700 2024-11-01T16:48:27.2589995Z BDFID: 33536 2024-11-01T16:48:27.2591064Z Internal Node ID: 3 2024-11-01T16:48:27.2592176Z Compute Unit: 104 2024-11-01T16:48:27.2593269Z SIMDs per CU: 4 2024-11-01T16:48:27.2593922Z Shader Engines: 8 2024-11-01T16:48:27.2594452Z Shader Arrs. per Eng.: 1 2024-11-01T16:48:27.2594982Z WatchPts on Addr. Ranges:4 2024-11-01T16:48:27.2595663Z Coherent Host Access: FALSE 2024-11-01T16:48:27.2596109Z Memory Properties: 2024-11-01T16:48:27.2596486Z Features: KERNEL_DISPATCH 2024-11-01T16:48:27.2596992Z Fast F16 Operation: TRUE 2024-11-01T16:48:27.2597504Z Wavefront Size: 64(0x40) 2024-11-01T16:48:27.2598015Z Workgroup Max Size: 1024(0x400) 2024-11-01T16:48:27.2598473Z Workgroup Max Size per Dimension: 2024-11-01T16:48:27.2598885Z x 1024(0x400) 2024-11-01T16:48:27.2599349Z y 1024(0x400) 2024-11-01T16:48:27.2599974Z z 1024(0x400) 2024-11-01T16:48:27.2601047Z Max Waves Per CU: 32(0x20) 2024-11-01T16:48:27.2602367Z Max Work-item Per CU: 2048(0x800) 2024-11-01T16:48:27.2603376Z Grid Max Size: 4294967295(0xffffffff) 2024-11-01T16:48:27.2604231Z Grid Max Size per Dimension: 2024-11-01T16:48:27.2604946Z x 4294967295(0xffffffff) 2024-11-01T16:48:27.2605788Z y 4294967295(0xffffffff) 2024-11-01T16:48:27.2606630Z z 4294967295(0xffffffff) 2024-11-01T16:48:27.2607557Z Max fbarriers/Workgrp: 32 2024-11-01T16:48:27.2608580Z Packet Processor uCode:: 83 2024-11-01T16:48:27.2609574Z SDMA engine uCode:: 8 2024-11-01T16:48:27.2610524Z IOMMU Support:: None 2024-11-01T16:48:27.2611357Z Pool Info: 2024-11-01T16:48:27.2611958Z Pool 1 2024-11-01T16:48:27.2612743Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-11-01T16:48:27.2613741Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:48:27.2614716Z Allocatable: TRUE 2024-11-01T16:48:27.2616022Z Alloc Granule: 4KB 2024-11-01T16:48:27.2617078Z Alloc Recommended Granule:2048KB 2024-11-01T16:48:27.2618255Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2619278Z Accessible by all: FALSE 2024-11-01T16:48:27.2620120Z Pool 2 2024-11-01T16:48:27.2620919Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2024-11-01T16:48:27.2621893Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:48:27.2622821Z Allocatable: TRUE 2024-11-01T16:48:27.2623800Z Alloc Granule: 4KB 2024-11-01T16:48:27.2624832Z Alloc Recommended Granule:2048KB 2024-11-01T16:48:27.2625873Z Alloc Alignment: 4KB 2024-11-01T16:48:27.2626876Z Accessible by all: FALSE 2024-11-01T16:48:27.2627713Z Pool 3 2024-11-01T16:48:27.2628518Z Segment: GROUP 2024-11-01T16:48:27.2629577Z Size: 64(0x40) KB 2024-11-01T16:48:27.2630680Z Allocatable: FALSE 2024-11-01T16:48:27.2631810Z Alloc Granule: 0KB 2024-11-01T16:48:27.2633035Z Alloc Recommended Granule:0KB 2024-11-01T16:48:27.2633902Z Alloc Alignment: 0KB 2024-11-01T16:48:27.2634682Z Accessible by all: FALSE 2024-11-01T16:48:27.2635344Z ISA Info: 2024-11-01T16:48:27.2635680Z ISA 1 2024-11-01T16:48:27.2636248Z Name: amdgcn-amd-amdhsa--gfx90a:sramecc+:xnack- 2024-11-01T16:48:27.2636854Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-11-01T16:48:27.2637384Z Profiles: HSA_PROFILE_BASE 2024-11-01T16:48:27.2637909Z Default Rounding Mode: NEAR 2024-11-01T16:48:27.2638439Z Default Rounding Mode: NEAR 2024-11-01T16:48:27.2638955Z Fast f16: TRUE 2024-11-01T16:48:27.2639465Z Workgroup Max Size: 1024(0x400) 2024-11-01T16:48:27.2639930Z Workgroup Max Size per Dimension: 2024-11-01T16:48:27.2640374Z x 1024(0x400) 2024-11-01T16:48:27.2640842Z y 1024(0x400) 2024-11-01T16:48:27.2641311Z z 1024(0x400) 2024-11-01T16:48:27.2641827Z Grid Max Size: 4294967295(0xffffffff) 2024-11-01T16:48:27.2642295Z Grid Max Size per Dimension: 2024-11-01T16:48:27.2642702Z x 4294967295(0xffffffff) 2024-11-01T16:48:27.2643151Z y 4294967295(0xffffffff) 2024-11-01T16:48:27.2643625Z z 4294967295(0xffffffff) 2024-11-01T16:48:27.2644122Z FBarrier Max Size: 32 2024-11-01T16:48:27.2644566Z *** Done *** 2024-11-01T16:48:27.2678277Z ##[group]Run ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2024-11-01T16:48:27.2678844Z ngpu=$(rocminfo | grep -c -E 'Name:.*\sgfx') 2024-11-01T16:48:27.2680254Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-11-01T16:48:27.2681811Z if [[ $ngpu -eq 0 ]]; then 2024-11-01T16:48:27.2682794Z  echo "Error: Failed to detect any GPUs on the runner" 2024-11-01T16:48:27.2683514Z  echo "$msg" 2024-11-01T16:48:27.2683973Z  exit 1 2024-11-01T16:48:27.2684385Z fi 2024-11-01T16:48:27.2684782Z if [[ $ngpu -eq 1 ]]; then 2024-11-01T16:48:27.2685671Z  echo "Error: only 1 GPU detected, at least 2 GPUs are needed for distributed jobs" 2024-11-01T16:48:27.2686570Z  echo "$msg" 2024-11-01T16:48:27.2687011Z  exit 1 2024-11-01T16:48:27.2687417Z fi 2024-11-01T16:48:27.2712608Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:27.2713373Z env: 2024-11-01T16:48:27.2713681Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.2714111Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.2714528Z ##[endgroup] 2024-11-01T16:48:27.3731444Z Prepare all required actions 2024-11-01T16:48:27.3800190Z ##[group]Run ./.github/actions/diskspace-cleanup 2024-11-01T16:48:27.3800980Z with: 2024-11-01T16:48:27.3801467Z diskspace-cutoff: 70 2024-11-01T16:48:27.3802036Z env: 2024-11-01T16:48:27.3802513Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.3803204Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.3803993Z ##[endgroup] 2024-11-01T16:48:27.3919334Z ##[group]Run set -ex 2024-11-01T16:48:27.3919845Z set -ex 2024-11-01T16:48:27.3920522Z diskspace_cutoff=70 2024-11-01T16:48:27.3921591Z docker_root_dir=$(docker info -f '{{.DockerRootDir}}') 2024-11-01T16:48:27.3923157Z diskspace=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2024-11-01T16:48:27.3925428Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-11-01T16:48:27.3927699Z if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-11-01T16:48:27.3928650Z  docker system prune -af 2024-11-01T16:48:27.3929980Z  diskspace_new=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2024-11-01T16:48:27.3931470Z  if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-11-01T16:48:27.3933031Z  echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-11-01T16:48:27.3934346Z  echo "$msg" 2024-11-01T16:48:27.3934976Z  exit 1 2024-11-01T16:48:27.3935557Z  else 2024-11-01T16:48:27.3936258Z  difference=$((diskspace - diskspace_new)) 2024-11-01T16:48:27.3937289Z  echo "Diskspace saved: $difference percent" 2024-11-01T16:48:27.3938363Z  fi 2024-11-01T16:48:27.3938888Z fi 2024-11-01T16:48:27.3972169Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:27.3973087Z env: 2024-11-01T16:48:27.3973583Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.3974303Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.3975121Z ##[endgroup] 2024-11-01T16:48:27.4038010Z + diskspace_cutoff=70 2024-11-01T16:48:27.4044657Z ++ docker info -f '{{.DockerRootDir}}' 2024-11-01T16:48:27.4582220Z + docker_root_dir=/home/pytorchci/.local/share/docker 2024-11-01T16:48:27.4590296Z ++ df -H --output=pcent /home/pytorchci/.local/share/docker 2024-11-01T16:48:27.4590873Z ++ sed -n 2p 2024-11-01T16:48:27.4591530Z ++ sed s/%// 2024-11-01T16:48:27.4593201Z ++ sed 's/ //' 2024-11-01T16:48:27.4607599Z + diskspace=21 2024-11-01T16:48:27.4608602Z + msg='Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified' 2024-11-01T16:48:27.4609718Z + [[ 21 -ge 70 ]] 2024-11-01T16:48:27.4653564Z ##[group]Run env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-11-01T16:48:27.4654580Z env | grep '^GITHUB' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-11-01T16:48:27.4655470Z env | grep '^CI' >> "/tmp/github_env_${GITHUB_RUN_ID}" 2024-11-01T16:48:27.4682032Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:27.4682746Z env: 2024-11-01T16:48:27.4683138Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.4683701Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.4684327Z ##[endgroup] 2024-11-01T16:48:27.4800314Z ##[group]Run # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2024-11-01T16:48:27.4801095Z # All GPUs are visible to the runner; visibility, if needed, will be set by run_test.py. 2024-11-01T16:48:27.4801946Z echo "GPU_FLAG=--device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon" >> "${GITHUB_ENV}" 2024-11-01T16:48:27.4817347Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:27.4817749Z env: 2024-11-01T16:48:27.4817991Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.4818416Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.4818818Z ##[endgroup] 2024-11-01T16:48:27.4981179Z ##[group]Run aws-actions/configure-aws-credentials@v4 2024-11-01T16:48:27.4982101Z with: 2024-11-01T16:48:27.4982991Z role-to-assume: arn:aws:iam::308535385114:role/gha_workflow_s3_and_ecr_read_only 2024-11-01T16:48:27.4984088Z aws-region: us-east-1 2024-11-01T16:48:27.4984703Z role-duration-seconds: 18000 2024-11-01T16:48:27.4985381Z audience: sts.amazonaws.com 2024-11-01T16:48:27.4985985Z env: 2024-11-01T16:48:27.4986501Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:27.4987343Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:27.4988899Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:27.4990309Z ##[endgroup] 2024-11-01T16:48:27.9588065Z Assuming role with OIDC 2024-11-01T16:48:28.3086712Z Authenticated as assumedRoleId AROAUPVRELQNLLCOPFEJR:GitHubActions 2024-11-01T16:48:28.4246179Z ##[group]Run aws-actions/amazon-ecr-login@v2 2024-11-01T16:48:28.4247039Z with: 2024-11-01T16:48:28.4247589Z mask-password: true 2024-11-01T16:48:28.4248246Z registry-type: private 2024-11-01T16:48:28.4248904Z skip-logout: false 2024-11-01T16:48:28.4249472Z env: 2024-11-01T16:48:28.4249982Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:28.4250734Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:28.4252102Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:28.4253375Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:28.4254070Z AWS_REGION: us-east-1 2024-11-01T16:48:28.4255380Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:28.4256413Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:28.4268723Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:28.4269371Z ##[endgroup] 2024-11-01T16:48:28.9320061Z Logging into registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:29.6598508Z ##[group]Run pytorch/test-infra/.github/actions/calculate-docker-image@main 2024-11-01T16:48:29.6599398Z with: 2024-11-01T16:48:29.6600632Z docker-image-name: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:29.6601996Z docker-build-dir: .ci/docker 2024-11-01T16:48:29.6602550Z working-directory: . 2024-11-01T16:48:29.6603226Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:29.6603969Z force-push: false 2024-11-01T16:48:29.6604413Z env: 2024-11-01T16:48:29.6604815Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:29.6605397Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:29.6606443Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:29.6607418Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:29.6607957Z AWS_REGION: us-east-1 2024-11-01T16:48:29.6608761Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:29.6609580Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:29.6619227Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:29.6620055Z ##[endgroup] 2024-11-01T16:48:29.6652744Z ##[group]Run set -ex 2024-11-01T16:48:29.6653264Z set -ex 2024-11-01T16:48:29.6653705Z  2024-11-01T16:48:29.6654494Z # If the docker build directory or the build script doesn't exist, the action will 2024-11-01T16:48:29.6655786Z # gracefully return the docker image name as it is. Pulling docker image in Linux 2024-11-01T16:48:29.6656883Z # job could then download the pre-built image as usual 2024-11-01T16:48:29.6657905Z if [[ ! -d "${DOCKER_BUILD_DIR}" ]] || [[ ! -f "${DOCKER_BUILD_DIR}/build.sh" ]]; then 2024-11-01T16:48:29.6658994Z  echo "skip=true" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6659864Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6660640Z  2024-11-01T16:48:29.6661364Z  echo "There is no Docker build script in ${REPO_NAME} repo, skipping..." 2024-11-01T16:48:29.6662239Z  exit 0 2024-11-01T16:48:29.6662699Z else 2024-11-01T16:48:29.6663228Z  echo "skip=false" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6663858Z fi 2024-11-01T16:48:29.6664265Z  2024-11-01T16:48:29.6664927Z if [[ "${DOCKER_IMAGE_NAME}" == *"${DOCKER_REGISTRY}/${REPO_NAME}"* ]]; then 2024-11-01T16:48:29.6666087Z  # The docker image name already includes the ECR prefix and tag, so we can just 2024-11-01T16:48:29.6667141Z  # use it as it is, but first let's extract the tag 2024-11-01T16:48:29.6668097Z  DOCKER_TAG=$(echo "${DOCKER_IMAGE_NAME}" | awk -F '[:,]' '{print $2}') 2024-11-01T16:48:29.6669129Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6670254Z  echo "docker-image=${DOCKER_IMAGE_NAME}" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6671167Z else 2024-11-01T16:48:29.6671900Z  DOCKER_TAG=$(git rev-parse HEAD:"${DOCKER_BUILD_DIR}") 2024-11-01T16:48:29.6672947Z  echo "docker-tag=${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6674370Z  echo "docker-image=${DOCKER_REGISTRY}/${REPO_NAME}/${DOCKER_IMAGE_NAME}:${DOCKER_TAG}" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6675416Z fi 2024-11-01T16:48:29.6702498Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:29.6703213Z env: 2024-11-01T16:48:29.6703629Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:29.6704234Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:29.6705280Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:29.6706265Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:29.6706818Z AWS_REGION: us-east-1 2024-11-01T16:48:29.6707405Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:29.6708104Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:29.6722724Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:29.6723273Z REPO_NAME: pytorch 2024-11-01T16:48:29.6724508Z DOCKER_IMAGE_NAME: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:29.6725842Z DOCKER_BUILD_DIR: .ci/docker 2024-11-01T16:48:29.6726533Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:29.6727262Z ##[endgroup] 2024-11-01T16:48:29.6786843Z + [[ ! -d .ci/docker ]] 2024-11-01T16:48:29.6787463Z + [[ ! -f .ci/docker/build.sh ]] 2024-11-01T16:48:29.6788036Z + echo skip=false 2024-11-01T16:48:29.6790188Z + [[ 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 == *\3\0\8\5\3\5\3\8\5\1\1\4\.\d\k\r\.\e\c\r\.\u\s\-\e\a\s\t\-\1\.\a\m\a\z\o\n\a\w\s\.\c\o\m\/\p\y\t\o\r\c\h* ]] 2024-11-01T16:48:29.6804387Z ++ echo 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:29.6806298Z ++ awk -F '[:,]' '{print $2}' 2024-11-01T16:48:29.6840179Z + DOCKER_TAG=bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:29.6841617Z + echo docker-tag=bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:29.6842787Z + echo docker-image=308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:29.6914934Z ##[group]Run set +e 2024-11-01T16:48:29.6915630Z set +e 2024-11-01T16:48:29.6916193Z set -x 2024-11-01T16:48:29.6916728Z  2024-11-01T16:48:29.6917262Z login() { 2024-11-01T16:48:29.6918483Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2024-11-01T16:48:29.6919765Z } 2024-11-01T16:48:29.6920304Z  2024-11-01T16:48:29.6920834Z retry () { 2024-11-01T16:48:29.6921601Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2024-11-01T16:48:29.6922420Z } 2024-11-01T16:48:29.6922930Z  2024-11-01T16:48:29.6923521Z retry login "${DOCKER_REGISTRY}" 2024-11-01T16:48:29.6924308Z  2024-11-01T16:48:29.6925168Z # Check if image already exists, if it does then skip building it 2024-11-01T16:48:29.6926441Z if docker manifest inspect "${DOCKER_IMAGE}"; then 2024-11-01T16:48:29.6927354Z  exit 0 2024-11-01T16:48:29.6927916Z fi 2024-11-01T16:48:29.6928428Z  2024-11-01T16:48:29.6929341Z # NB: This part requires a full checkout. Otherwise, the merge base will 2024-11-01T16:48:29.6930823Z # be empty. The default action would be to continue rebuild the image 2024-11-01T16:48:29.6932138Z if [[ "$BASE_REVISION" = "$(git rev-parse HEAD)" ]]; then 2024-11-01T16:48:29.6933353Z  # if we're on the base branch then use the parent commit 2024-11-01T16:48:29.6934427Z  MERGE_BASE=$(git rev-parse HEAD~) 2024-11-01T16:48:29.6935232Z else 2024-11-01T16:48:29.6936093Z  # otherwise we're on a PR, so use the most recent base commit 2024-11-01T16:48:29.6937352Z  MERGE_BASE=$(git merge-base HEAD "$BASE_REVISION") 2024-11-01T16:48:29.6938439Z fi 2024-11-01T16:48:29.6938979Z  2024-11-01T16:48:29.6939575Z if [[ -z "${MERGE_BASE}" ]]; then 2024-11-01T16:48:29.6940493Z  echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6941312Z  2024-11-01T16:48:29.6942484Z  echo "Finding merge base only works with full checkout, please set fetch-depth to 0, continuing ..." 2024-11-01T16:48:29.6943838Z  exit 0 2024-11-01T16:48:29.6944401Z fi 2024-11-01T16:48:29.6944912Z  2024-11-01T16:48:29.6945707Z if ! git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}"; then 2024-11-01T16:48:29.6947850Z  echo "Directory '${DOCKER_BUILD_DIR}' not found in commit $MERGE_BASE, you should rebase onto a more recent commit" 2024-11-01T16:48:29.6949312Z  exit 1 2024-11-01T16:48:29.6949870Z fi 2024-11-01T16:48:29.6950395Z  2024-11-01T16:48:29.6951285Z PREVIOUS_DOCKER_TAG=$(git rev-parse "${MERGE_BASE}:${DOCKER_BUILD_DIR}") 2024-11-01T16:48:29.6952983Z # If no image exists but the hash is the same as the previous hash then we should error out here 2024-11-01T16:48:29.6954535Z if [[ "${PREVIOUS_DOCKER_TAG}" == "${DOCKER_TAG}" ]]; then 2024-11-01T16:48:29.6956262Z  echo "WARNING: Something has gone wrong and the previous image isn't available for the merge-base of your branch" 2024-11-01T16:48:29.6958198Z  echo " Will re-build docker image to store in local cache, TTS may be longer" 2024-11-01T16:48:29.6959347Z fi 2024-11-01T16:48:29.6959863Z  2024-11-01T16:48:29.6960521Z echo "rebuild=true" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:29.6996091Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:29.6997037Z env: 2024-11-01T16:48:29.6997572Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:29.6998754Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:29.7000145Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:29.7001432Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:29.7002137Z AWS_REGION: us-east-1 2024-11-01T16:48:29.7002951Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:29.7003883Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:29.7015938Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:29.7016610Z DOCKER_BUILD_DIR: .ci/docker 2024-11-01T16:48:29.7017481Z BASE_REVISION: d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:48:29.7019525Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:29.7021351Z DOCKER_TAG: bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:29.7022444Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:29.7023405Z ##[endgroup] 2024-11-01T16:48:29.7095160Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:29.7096380Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:29.7102025Z + aws ecr get-login-password --region us-east-1 2024-11-01T16:48:29.7104092Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:31.0986889Z WARNING! Your password will be stored unencrypted in /home/pytorchci/.docker/config.json. 2024-11-01T16:48:31.0988522Z Configure a credential helper to remove this warning. See 2024-11-01T16:48:31.0990239Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2024-11-01T16:48:31.0991209Z 2024-11-01T16:48:31.0992378Z Login Succeeded 2024-11-01T16:48:31.1026184Z + docker manifest inspect 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:32.4468987Z { 2024-11-01T16:48:32.4469598Z "schemaVersion": 2, 2024-11-01T16:48:32.4470686Z "mediaType": "application/vnd.docker.distribution.manifest.v2+json", 2024-11-01T16:48:32.4471850Z "config": { 2024-11-01T16:48:32.4472734Z "mediaType": "application/vnd.docker.container.image.v1+json", 2024-11-01T16:48:32.4473737Z "size": 24320, 2024-11-01T16:48:32.4474773Z "digest": "sha256:4d076369ac6bdd477f80b455e0eafad40dbf413bb6bca80e5a8acefa29995600" 2024-11-01T16:48:32.4475914Z }, 2024-11-01T16:48:32.4476410Z "layers": [ 2024-11-01T16:48:32.4476931Z { 2024-11-01T16:48:32.4477735Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4479008Z "size": 28583948, 2024-11-01T16:48:32.4480168Z "digest": "sha256:86e5016c269355b382c9cabab4f6646d56d75914f20d545289970436dae431b1" 2024-11-01T16:48:32.4481483Z }, 2024-11-01T16:48:32.4482031Z { 2024-11-01T16:48:32.4488054Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4488622Z "size": 1822, 2024-11-01T16:48:32.4489166Z "digest": "sha256:162b6138f2b03f11bc7d166adcec6d7edc9543aff23647976a7ccf9c9eb42e73" 2024-11-01T16:48:32.4489786Z }, 2024-11-01T16:48:32.4490316Z { 2024-11-01T16:48:32.4490733Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4491248Z "size": 312559684, 2024-11-01T16:48:32.4491773Z "digest": "sha256:f2772a057b93184d18e0a4ce2e0c7df9490978287bbaff68e6240a5de81d8827" 2024-11-01T16:48:32.4492341Z }, 2024-11-01T16:48:32.4492582Z { 2024-11-01T16:48:32.4492989Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4493500Z "size": 864, 2024-11-01T16:48:32.4494011Z "digest": "sha256:a07e96d4579e0cbb485fcdb25d0777d240f0cd0fd8222ce7cea1056c9fdeec1d" 2024-11-01T16:48:32.4494593Z }, 2024-11-01T16:48:32.4494834Z { 2024-11-01T16:48:32.4495237Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4495757Z "size": 106, 2024-11-01T16:48:32.4496254Z "digest": "sha256:fc4226a98863f023397c2c581c98b11b40e1b116bc177f3c1fe9faf0dad9a7e7" 2024-11-01T16:48:32.4497072Z }, 2024-11-01T16:48:32.4497319Z { 2024-11-01T16:48:32.4497720Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4498373Z "size": 704, 2024-11-01T16:48:32.4498894Z "digest": "sha256:98e488266c9caf9e952b8351e89c1c082371ecd2ed67df55f5c7ccb59180e23b" 2024-11-01T16:48:32.4499467Z }, 2024-11-01T16:48:32.4499707Z { 2024-11-01T16:48:32.4500110Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4500616Z "size": 1260, 2024-11-01T16:48:32.4501146Z "digest": "sha256:631b0545c1dc2d77efadfcdfae1fe1fa0edb9ea6be6f7233885e79aab09802a6" 2024-11-01T16:48:32.4501752Z }, 2024-11-01T16:48:32.4501992Z { 2024-11-01T16:48:32.4502404Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4502915Z "size": 3678, 2024-11-01T16:48:32.4503427Z "digest": "sha256:33613f42c2e6eec7e082c2c1119e142e3f9f04266c95cfb6f0b96baf51b99175" 2024-11-01T16:48:32.4503989Z }, 2024-11-01T16:48:32.4504240Z { 2024-11-01T16:48:32.4504642Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4505154Z "size": 1801, 2024-11-01T16:48:32.4505665Z "digest": "sha256:4f9ae8abe33756cfd7b80b90bd7b1af316c0394e58bf41e7c1586fa021e4509a" 2024-11-01T16:48:32.4506242Z }, 2024-11-01T16:48:32.4506479Z { 2024-11-01T16:48:32.4506880Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4507392Z "size": 702, 2024-11-01T16:48:32.4507898Z "digest": "sha256:f45ff9cb5ae341e9958b6a54478fa78323dcab64bcecf414e207f196a31530f2" 2024-11-01T16:48:32.4508843Z }, 2024-11-01T16:48:32.4509298Z { 2024-11-01T16:48:32.4510064Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4511053Z "size": 2684244768, 2024-11-01T16:48:32.4512061Z "digest": "sha256:79f90e3b35e3589cc7a225f3644dc07793ce58a89ee9d3930e5380de85858c8e" 2024-11-01T16:48:32.4513135Z }, 2024-11-01T16:48:32.4513585Z { 2024-11-01T16:48:32.4514436Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4515496Z "size": 380, 2024-11-01T16:48:32.4516461Z "digest": "sha256:26a1748cd9372ed1db6b0199fd5e837823c3d9b34e863ab4d8ff816af972f4e0" 2024-11-01T16:48:32.4517554Z }, 2024-11-01T16:48:32.4517998Z { 2024-11-01T16:48:32.4518926Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4519981Z "size": 12113, 2024-11-01T16:48:32.4520942Z "digest": "sha256:46077999b29ee2cd69577fb7695cf83271c7e559726ec7c6f5a2d4c9f93eae42" 2024-11-01T16:48:32.4522006Z }, 2024-11-01T16:48:32.4522460Z { 2024-11-01T16:48:32.4523213Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4524173Z "size": 504, 2024-11-01T16:48:32.4525441Z "digest": "sha256:298f94ba17808fb4fa481ebed4e657c49d2f8d6e2de4d6c64f4ce4a5ddac5839" 2024-11-01T16:48:32.4526539Z }, 2024-11-01T16:48:32.4526988Z { 2024-11-01T16:48:32.4527745Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4528734Z "size": 121477455, 2024-11-01T16:48:32.4529775Z "digest": "sha256:b5de01ce3c49d8ea251ba514ecd44d5a1990afd04c6ae45b9a7ab35edad56038" 2024-11-01T16:48:32.4530901Z }, 2024-11-01T16:48:32.4531359Z { 2024-11-01T16:48:32.4532150Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4533120Z "size": 109, 2024-11-01T16:48:32.4534053Z "digest": "sha256:545e959ed0e880eef71900f4d98644d203829af6579697320009d114c075969a" 2024-11-01T16:48:32.4535135Z }, 2024-11-01T16:48:32.4535580Z { 2024-11-01T16:48:32.4536357Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4537346Z "size": 490, 2024-11-01T16:48:32.4538537Z "digest": "sha256:0d6cd48338a2b75af7392982c4550ec831a096d07be2cc4b53e9f7a6fdcd04cc" 2024-11-01T16:48:32.4539857Z }, 2024-11-01T16:48:32.4540383Z { 2024-11-01T16:48:32.4541253Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4542531Z "size": 297, 2024-11-01T16:48:32.4543477Z "digest": "sha256:e189d4f6677473e231661cf42501a78bfc04fd8a55a429e6330fa7223d8fd8f2" 2024-11-01T16:48:32.4544556Z }, 2024-11-01T16:48:32.4545005Z { 2024-11-01T16:48:32.4545767Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4546737Z "size": 103, 2024-11-01T16:48:32.4547702Z "digest": "sha256:d6802c7f20ed5257008c8dc9d18c79c1196d1b1b390ccb307c9a0508b833b105" 2024-11-01T16:48:32.4548842Z }, 2024-11-01T16:48:32.4549377Z { 2024-11-01T16:48:32.4550300Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4551459Z "size": 1473, 2024-11-01T16:48:32.4552609Z "digest": "sha256:56a128ca4cdc55990dc1622c87d1c6b7de79c31829cc25560f9f020e903900c4" 2024-11-01T16:48:32.4553915Z }, 2024-11-01T16:48:32.4554383Z { 2024-11-01T16:48:32.4555149Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4556138Z "size": 426816767, 2024-11-01T16:48:32.4557151Z "digest": "sha256:777a3fad188d9b3d7842e6408fefb18952ba4c9cfd7b6d1615c6907804577114" 2024-11-01T16:48:32.4558258Z }, 2024-11-01T16:48:32.4558717Z { 2024-11-01T16:48:32.4559568Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4560732Z "size": 163, 2024-11-01T16:48:32.4561720Z "digest": "sha256:7bd376e7570dd04b940d25fa18e82cede2981f05cbc3772fc0041165174ca2d2" 2024-11-01T16:48:32.4562804Z }, 2024-11-01T16:48:32.4563259Z { 2024-11-01T16:48:32.4564029Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4564986Z "size": 1640, 2024-11-01T16:48:32.4565979Z "digest": "sha256:7beadf9babec249728f9f1f97764bab920f82b6dbfd6216150ab34aa6524f875" 2024-11-01T16:48:32.4567105Z }, 2024-11-01T16:48:32.4567573Z { 2024-11-01T16:48:32.4568411Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4569618Z "size": 7842049879, 2024-11-01T16:48:32.4570805Z "digest": "sha256:d09605f5be3506ee239f9b2fd1080b1a94c9395e1d26592335204b510ba32987" 2024-11-01T16:48:32.4571954Z }, 2024-11-01T16:48:32.4572414Z { 2024-11-01T16:48:32.4573184Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4574169Z "size": 105, 2024-11-01T16:48:32.4575153Z "digest": "sha256:23d9b59fc2d6ba444e4e9dcda52e784e75d092b74bcd343fe6449963eb93d0a6" 2024-11-01T16:48:32.4576264Z }, 2024-11-01T16:48:32.4576719Z { 2024-11-01T16:48:32.4577485Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4578612Z "size": 974, 2024-11-01T16:48:32.4579567Z "digest": "sha256:32757d535f12983953ed7fa3e6c9105c523cc59f9de46009d99d2006effd9095" 2024-11-01T16:48:32.4580665Z }, 2024-11-01T16:48:32.4581114Z { 2024-11-01T16:48:32.4582170Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4583168Z "size": 318526641, 2024-11-01T16:48:32.4584152Z "digest": "sha256:176796974e42836ae83a90fd8a36f1c55314712beb7305e1b61fe62d90c9461a" 2024-11-01T16:48:32.4585226Z }, 2024-11-01T16:48:32.4585681Z { 2024-11-01T16:48:32.4586442Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4587412Z "size": 111, 2024-11-01T16:48:32.4588472Z "digest": "sha256:a1fb76efb5786598c6e880d39d27ce17817e0dcd5906512cb018e2892964d682" 2024-11-01T16:48:32.4589768Z }, 2024-11-01T16:48:32.4590298Z { 2024-11-01T16:48:32.4591192Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4592347Z "size": 1829, 2024-11-01T16:48:32.4593468Z "digest": "sha256:fb133b23798c65c183654eb1d111cbe9d143d29870b86f8e43580b5b69e2caa9" 2024-11-01T16:48:32.4594552Z }, 2024-11-01T16:48:32.4595006Z { 2024-11-01T16:48:32.4595786Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4596780Z "size": 619478538, 2024-11-01T16:48:32.4597759Z "digest": "sha256:0569484aa5b4385d29ef9490a19b583b672100a675957ac38746fb077b759849" 2024-11-01T16:48:32.4599288Z }, 2024-11-01T16:48:32.4599822Z { 2024-11-01T16:48:32.4600703Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4601687Z "size": 167, 2024-11-01T16:48:32.4602646Z "digest": "sha256:eb69f898c8797ad483631ea108e76e54259cc907a09784dc8857a06a3ffe0e57" 2024-11-01T16:48:32.4603725Z }, 2024-11-01T16:48:32.4604175Z { 2024-11-01T16:48:32.4604951Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4605930Z "size": 2854540, 2024-11-01T16:48:32.4606919Z "digest": "sha256:283525ceebdf5dc275b9425fc557b2b8d424b8c3a9e2a3c073ff335160103d01" 2024-11-01T16:48:32.4607993Z }, 2024-11-01T16:48:32.4608513Z { 2024-11-01T16:48:32.4609460Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4610600Z "size": 107, 2024-11-01T16:48:32.4611636Z "digest": "sha256:0d1abd111f4dd717dd3dcad5c3952dfed808be7bf1ef590c0678aa94b46c1c14" 2024-11-01T16:48:32.4612744Z }, 2024-11-01T16:48:32.4613202Z { 2024-11-01T16:48:32.4613964Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4614934Z "size": 566, 2024-11-01T16:48:32.4615895Z "digest": "sha256:b4868ebe0605763563ea0bfae50d1c902ab28491ff1d79d6b52f5f3ae35cf910" 2024-11-01T16:48:32.4616987Z }, 2024-11-01T16:48:32.4617447Z { 2024-11-01T16:48:32.4618383Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4619566Z "size": 43194130, 2024-11-01T16:48:32.4620759Z "digest": "sha256:9fc532b0af2f61d2359cd94f813d79a2adcdee9abf19791554de15d0af60f989" 2024-11-01T16:48:32.4622059Z }, 2024-11-01T16:48:32.4622598Z { 2024-11-01T16:48:32.4623446Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4624411Z "size": 106, 2024-11-01T16:48:32.4625390Z "digest": "sha256:90c3c714af3689cefd8bcc0b6d59b5c573a87b2c9795e2084ef418c8b3404c38" 2024-11-01T16:48:32.4626473Z }, 2024-11-01T16:48:32.4626923Z { 2024-11-01T16:48:32.4627694Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4628659Z "size": 294, 2024-11-01T16:48:32.4629601Z "digest": "sha256:6417650b4dec807ec6d46758c24117f43e85f3a9e1028cc354df4824245109c5" 2024-11-01T16:48:32.4630675Z }, 2024-11-01T16:48:32.4631124Z { 2024-11-01T16:48:32.4631892Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4632861Z "size": 88286, 2024-11-01T16:48:32.4633864Z "digest": "sha256:6fc4dbdf12d4bce3c6e44af38173e6e22944b6eb8dd8e6fc21e752640db877ec" 2024-11-01T16:48:32.4634966Z }, 2024-11-01T16:48:32.4635411Z { 2024-11-01T16:48:32.4636163Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4637119Z "size": 106, 2024-11-01T16:48:32.4638411Z "digest": "sha256:15bc9b64c61392bff9a10be3fe8d69c90559b10d607eaf23c747c9da5772b8b0" 2024-11-01T16:48:32.4639717Z }, 2024-11-01T16:48:32.4640246Z { 2024-11-01T16:48:32.4641132Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4642111Z "size": 1402, 2024-11-01T16:48:32.4643095Z "digest": "sha256:9bbd022454bb3dc9fb6690e011a9aa0784f8ea91ff4eda4c642ff112568b2b03" 2024-11-01T16:48:32.4644198Z }, 2024-11-01T16:48:32.4644651Z { 2024-11-01T16:48:32.4645409Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4646378Z "size": 702, 2024-11-01T16:48:32.4647343Z "digest": "sha256:f45ff9cb5ae341e9958b6a54478fa78323dcab64bcecf414e207f196a31530f2" 2024-11-01T16:48:32.4648441Z }, 2024-11-01T16:48:32.4648891Z { 2024-11-01T16:48:32.4649647Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4650660Z "size": 136, 2024-11-01T16:48:32.4651630Z "digest": "sha256:614bea1c0a739f0f037eb961dd827afa53e913e8de6588bb66907080f5ffa175" 2024-11-01T16:48:32.4652723Z }, 2024-11-01T16:48:32.4653177Z { 2024-11-01T16:48:32.4653934Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4655245Z "size": 120, 2024-11-01T16:48:32.4656200Z "digest": "sha256:9dce8d080268c9c364e311cfacd73d963991e3dc2684d910c3b5e2588fe30676" 2024-11-01T16:48:32.4657274Z }, 2024-11-01T16:48:32.4657724Z { 2024-11-01T16:48:32.4658633Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4659603Z "size": 4434541507, 2024-11-01T16:48:32.4660601Z "digest": "sha256:7f3712a06f75e7c6131cebf428acf3d45011d22d83864168c9eb376a8cc792b4" 2024-11-01T16:48:32.4661675Z }, 2024-11-01T16:48:32.4662124Z { 2024-11-01T16:48:32.4662876Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4663842Z "size": 176, 2024-11-01T16:48:32.4664803Z "digest": "sha256:5ca64a90cefdf08096b2c7732e5e85b972d5e501cd85258dea602f90e9183435" 2024-11-01T16:48:32.4665890Z }, 2024-11-01T16:48:32.4666349Z { 2024-11-01T16:48:32.4667107Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4668105Z "size": 209, 2024-11-01T16:48:32.4669273Z "digest": "sha256:ee728b2ce5abe76b5f8b1f1128107ea80d50f82ea0c464de1aeea4ae9f66c575" 2024-11-01T16:48:32.4670589Z }, 2024-11-01T16:48:32.4671119Z { 2024-11-01T16:48:32.4672017Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4673176Z "size": 702, 2024-11-01T16:48:32.4674151Z "digest": "sha256:f45ff9cb5ae341e9958b6a54478fa78323dcab64bcecf414e207f196a31530f2" 2024-11-01T16:48:32.4675253Z }, 2024-11-01T16:48:32.4675703Z { 2024-11-01T16:48:32.4676460Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4677430Z "size": 633, 2024-11-01T16:48:32.4678371Z "digest": "sha256:725423b7ca2ca9294a492201aef72a0f9c0604ccf17d0d03a2133a0488b82f4a" 2024-11-01T16:48:32.4679658Z }, 2024-11-01T16:48:32.4680184Z { 2024-11-01T16:48:32.4681096Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4682099Z "size": 652359953, 2024-11-01T16:48:32.4683088Z "digest": "sha256:e5f2017e4c0250718d09dd7cc301e8304ec9bba476ae08ef5822ea315f37c0c7" 2024-11-01T16:48:32.4684187Z }, 2024-11-01T16:48:32.4684640Z { 2024-11-01T16:48:32.4685392Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4686363Z "size": 2047, 2024-11-01T16:48:32.4687329Z "digest": "sha256:34af416b9973175fefc767b712903fdc05ba8d9e689811a83e5dfd226d27be4d" 2024-11-01T16:48:32.4688397Z }, 2024-11-01T16:48:32.4688856Z { 2024-11-01T16:48:32.4689617Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4690582Z "size": 15983068, 2024-11-01T16:48:32.4691576Z "digest": "sha256:9a372539a5bdeb26050d5f5b507262e52cb86c8bc4965568f3dfbe6e7f1f5d15" 2024-11-01T16:48:32.4692641Z }, 2024-11-01T16:48:32.4693086Z { 2024-11-01T16:48:32.4694141Z "mediaType": "application/vnd.docker.image.rootfs.diff.tar.gzip", 2024-11-01T16:48:32.4695111Z "size": 54145664, 2024-11-01T16:48:32.4696081Z "digest": "sha256:f47e4fa885c56af74fac500b96a440d217a07080565c57cda42f0ae5b2891f4f" 2024-11-01T16:48:32.4697160Z } 2024-11-01T16:48:32.4697609Z ] 2024-11-01T16:48:32.4698063Z } 2024-11-01T16:48:32.4698683Z + exit 0 2024-11-01T16:48:32.4962403Z ##[group]Run pytorch/test-infra/.github/actions/pull-docker-image@main 2024-11-01T16:48:32.4963221Z with: 2024-11-01T16:48:32.4964375Z docker-image: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:32.4965851Z docker-registry: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:32.4966565Z env: 2024-11-01T16:48:32.4966963Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:32.4967549Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:32.4968572Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:32.4969541Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:32.4970071Z AWS_REGION: us-east-1 2024-11-01T16:48:32.4970828Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:32.4971850Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:32.4981091Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:32.4981582Z ##[endgroup] 2024-11-01T16:48:32.5008281Z ##[group]Run set -x 2024-11-01T16:48:32.5008760Z set -x 2024-11-01T16:48:32.5009185Z set +e 2024-11-01T16:48:32.5009583Z  2024-11-01T16:48:32.5009975Z login() { 2024-11-01T16:48:32.5010893Z  aws ecr get-login-password --region us-east-1 | docker login -u AWS --password-stdin "$1" 2024-11-01T16:48:32.5011858Z } 2024-11-01T16:48:32.5012241Z  2024-11-01T16:48:32.5012635Z retry () { 2024-11-01T16:48:32.5013178Z  $* || (sleep 1 && $*) || (sleep 2 && $*) 2024-11-01T16:48:32.5013808Z } 2024-11-01T16:48:32.5014196Z  2024-11-01T16:48:32.5014650Z retry login "${DOCKER_REGISTRY}" 2024-11-01T16:48:32.5015222Z  2024-11-01T16:48:32.5015597Z set -e 2024-11-01T16:48:32.5016263Z # ignore output since only exit code is used for conditional 2024-11-01T16:48:32.5017204Z # only pull docker image if it's not available locally 2024-11-01T16:48:32.5018425Z if ! docker inspect --type=image "${DOCKER_IMAGE}" >/dev/null 2>/dev/null; then 2024-11-01T16:48:32.5019372Z  retry docker pull "${DOCKER_IMAGE}" 2024-11-01T16:48:32.5019968Z fi 2024-11-01T16:48:32.5042324Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:32.5042716Z env: 2024-11-01T16:48:32.5042943Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:32.5043270Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:32.5043837Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:32.5044372Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:32.5044674Z AWS_REGION: us-east-1 2024-11-01T16:48:32.5044996Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:32.5045385Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:32.5050344Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:32.5051034Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:32.5051824Z DOCKER_REGISTRY: 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:32.5052226Z ##[endgroup] 2024-11-01T16:48:32.5093767Z + set +e 2024-11-01T16:48:32.5094287Z + retry login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:32.5094809Z + login 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:32.5103423Z + docker login -u AWS --password-stdin 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T16:48:32.5104029Z + aws ecr get-login-password --region us-east-1 2024-11-01T16:48:33.9210494Z WARNING! Your password will be stored unencrypted in /home/pytorchci/.docker/config.json. 2024-11-01T16:48:33.9212146Z Configure a credential helper to remove this warning. See 2024-11-01T16:48:33.9213911Z https://docs.docker.com/engine/reference/commandline/login/#credentials-store 2024-11-01T16:48:33.9214928Z 2024-11-01T16:48:33.9215202Z Login Succeeded 2024-11-01T16:48:33.9258915Z + set -e 2024-11-01T16:48:33.9261934Z + docker inspect --type=image 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:48:33.9529424Z ##[group]Run python3 -m pip install psutil==5.9.1 nvidia-ml-py==11.525.84 2024-11-01T16:48:33.9530876Z python3 -m pip install psutil==5.9.1 nvidia-ml-py==11.525.84 2024-11-01T16:48:33.9532158Z python3 -m tools.stats.monitor > usage_log.txt 2>&1 & 2024-11-01T16:48:33.9533345Z echo "monitor-script-pid=${!}" >> "${GITHUB_OUTPUT}" 2024-11-01T16:48:33.9564402Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:48:33.9565324Z env: 2024-11-01T16:48:33.9565849Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:33.9566626Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:33.9568007Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:33.9569753Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:33.9570458Z AWS_REGION: us-east-1 2024-11-01T16:48:33.9571281Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:33.9572218Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:33.9584520Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:33.9585163Z ##[endgroup] 2024-11-01T16:48:34.2156378Z Defaulting to user installation because normal site-packages is not writeable 2024-11-01T16:48:34.2882833Z Requirement already satisfied: psutil==5.9.1 in /home/pytorchci/.local/lib/python3.10/site-packages (5.9.1) 2024-11-01T16:48:34.2887120Z Requirement already satisfied: nvidia-ml-py==11.525.84 in /home/pytorchci/.local/lib/python3.10/site-packages (11.525.84) 2024-11-01T16:48:34.4407977Z Prepare all required actions 2024-11-01T16:48:34.4409326Z Getting action download info 2024-11-01T16:48:34.6435091Z Download action repository 'seemethere/download-artifact-s3@v4' (SHA:1da556a7aa0a088e3153970611f6c432d58e80e6) 2024-11-01T16:48:35.2604670Z Download action repository 'actions/download-artifact@v3' (SHA:9bc31d5ccc31df68ecc42ccf4149144866c47d8a) 2024-11-01T16:48:35.7533187Z ##[group]Run ./.github/actions/download-build-artifacts 2024-11-01T16:48:35.7533859Z with: 2024-11-01T16:48:35.7534289Z name: linux-focal-rocm6.2-py3.10 2024-11-01T16:48:35.7534857Z s3-bucket: gha-artifacts 2024-11-01T16:48:35.7535321Z env: 2024-11-01T16:48:35.7535709Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:35.7536276Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:35.7537313Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:35.7538453Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:35.7539044Z AWS_REGION: us-east-1 2024-11-01T16:48:35.7539660Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:35.7540369Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:35.7549591Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:35.7550092Z ##[endgroup] 2024-11-01T16:48:35.7630009Z ##[group]Run seemethere/download-artifact-s3@v4 2024-11-01T16:48:35.7630387Z with: 2024-11-01T16:48:35.7630642Z name: linux-focal-rocm6.2-py3.10 2024-11-01T16:48:35.7630974Z s3-bucket: gha-artifacts 2024-11-01T16:48:35.7631277Z region: us-east-1 2024-11-01T16:48:35.7631522Z env: 2024-11-01T16:48:35.7631760Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:48:35.7632095Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:48:35.7632673Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:48:35.7633236Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:48:35.7633543Z AWS_REGION: us-east-1 2024-11-01T16:48:35.7633931Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:48:35.7634348Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:48:35.7639311Z AWS_SESSION_TOKEN: *** 2024-11-01T16:48:35.7639599Z ##[endgroup] 2024-11-01T16:48:36.1748444Z (node:2661045) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2024-11-01T16:48:36.1749855Z 2024-11-01T16:48:36.1750375Z Please migrate your code to use AWS SDK for JavaScript (v3). 2024-11-01T16:48:36.1751704Z For more information, check the migration guide at https://a.co/7PzMCcy 2024-11-01T16:48:36.1753384Z (Use `node --trace-warnings ...` to show where the warning was created) 2024-11-01T16:48:36.5098769Z Found 1 objects with prefix pytorch/pytorch/11632515612/linux-focal-rocm6.2-py3.10/ 2024-11-01T16:48:36.5100767Z Starting download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/artifacts.zip 2024-11-01T16:50:05.1290089Z Finished download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/artifacts.zip 2024-11-01T16:50:05.1310613Z Artifact download has finished successfully 2024-11-01T16:50:05.1828877Z ##[group]Run unzip -o artifacts.zip 2024-11-01T16:50:05.1829838Z unzip -o artifacts.zip 2024-11-01T16:50:05.1865242Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:05.1866827Z env: 2024-11-01T16:50:05.1867461Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:05.1868398Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:05.1870096Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:05.1871620Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:05.1872334Z AWS_REGION: us-east-1 2024-11-01T16:50:05.1873119Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:05.1874057Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:05.1887134Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:05.1887791Z ##[endgroup] 2024-11-01T16:50:05.1984248Z Archive: artifacts.zip 2024-11-01T16:50:05.1987196Z creating: dist/ 2024-11-01T16:50:09.7680750Z inflating: dist/torch-2.6.0a0+gitd1aa4ef-cp310-cp310-linux_x86_64.whl 2024-11-01T16:50:09.7804436Z inflating: dist/.ninja_log 2024-11-01T16:50:09.7805401Z creating: build/custom_test_artifacts/ 2024-11-01T16:50:09.7806660Z creating: build/custom_test_artifacts/custom-op-build/ 2024-11-01T16:50:09.7808258Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/ 2024-11-01T16:50:09.7809195Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/pkgRedirects/ 2024-11-01T16:50:09.7810132Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeConfigureLog.yaml 2024-11-01T16:50:09.7811107Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/ 2024-11-01T16:50:09.7812162Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-11-01T16:50:09.7813136Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-11-01T16:50:09.7814059Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-11-01T16:50:09.7815105Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-11-01T16:50:09.7816147Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-11-01T16:50:09.7817096Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-11-01T16:50:09.7818057Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-11-01T16:50:09.7819292Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-11-01T16:50:09.7820385Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-11-01T16:50:09.7821445Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-11-01T16:50:09.7822514Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-11-01T16:50:09.7823611Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-11-01T16:50:09.7824690Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-11-01T16:50:09.7825618Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeScratch/ 2024-11-01T16:50:09.7826432Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeTmp/ 2024-11-01T16:50:09.7827307Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/cmake.check_cache 2024-11-01T16:50:09.7828362Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/ 2024-11-01T16:50:09.7829374Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.ts 2024-11-01T16:50:09.7830448Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/compiler_depend.make 2024-11-01T16:50:09.7831486Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/depend.make 2024-11-01T16:50:09.7832449Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/link.txt 2024-11-01T16:50:09.7833699Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/cmake_clean.cmake 2024-11-01T16:50:09.7834707Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/build.make 2024-11-01T16:50:09.7835715Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/DependInfo.cmake 2024-11-01T16:50:09.7836717Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/flags.make 2024-11-01T16:50:09.7837929Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/progress.make 2024-11-01T16:50:09.7848825Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o.d 2024-11-01T16:50:09.7982244Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/custom_ops.dir/op.cpp.o 2024-11-01T16:50:09.7984349Z creating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/ 2024-11-01T16:50:09.7986417Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.ts 2024-11-01T16:50:09.7988574Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/compiler_depend.make 2024-11-01T16:50:09.7989666Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/depend.make 2024-11-01T16:50:09.7990688Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/link.txt 2024-11-01T16:50:09.7991744Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/cmake_clean.cmake 2024-11-01T16:50:09.7992808Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/build.make 2024-11-01T16:50:09.7993877Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/DependInfo.cmake 2024-11-01T16:50:09.7994952Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/flags.make 2024-11-01T16:50:09.7996012Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/progress.make 2024-11-01T16:50:09.8005952Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o.d 2024-11-01T16:50:09.8081918Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/test_custom_ops.dir/test_custom_ops.cpp.o 2024-11-01T16:50:09.8084250Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-11-01T16:50:09.8086233Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/TargetDirectories.txt 2024-11-01T16:50:09.8088055Z extracting: build/custom_test_artifacts/custom-op-build/CMakeFiles/progress.marks 2024-11-01T16:50:09.8089729Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile2 2024-11-01T16:50:09.8091367Z inflating: build/custom_test_artifacts/custom-op-build/CMakeFiles/Makefile.cmake 2024-11-01T16:50:09.8092970Z inflating: build/custom_test_artifacts/custom-op-build/hip_new_types.cc 2024-11-01T16:50:09.8094475Z inflating: build/custom_test_artifacts/custom-op-build/CMakeCache.txt 2024-11-01T16:50:09.8095904Z inflating: build/custom_test_artifacts/custom-op-build/Makefile 2024-11-01T16:50:09.8097375Z inflating: build/custom_test_artifacts/custom-op-build/cmake_install.cmake 2024-11-01T16:50:09.8196198Z inflating: build/custom_test_artifacts/custom-op-build/libcustom_ops.so 2024-11-01T16:50:09.8252605Z inflating: build/custom_test_artifacts/custom-op-build/test_custom_ops 2024-11-01T16:50:09.8254095Z creating: build/custom_test_artifacts/jit-hook-build/ 2024-11-01T16:50:09.8255453Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/ 2024-11-01T16:50:09.8257034Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/pkgRedirects/ 2024-11-01T16:50:09.8259101Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeConfigureLog.yaml 2024-11-01T16:50:09.8261296Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/ 2024-11-01T16:50:09.8262981Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-11-01T16:50:09.8264755Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-11-01T16:50:09.8266492Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-11-01T16:50:09.8268753Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-11-01T16:50:09.8270768Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-11-01T16:50:09.8272845Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-11-01T16:50:09.8274648Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-11-01T16:50:09.8276725Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-11-01T16:50:09.8278783Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-11-01T16:50:09.8280818Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-11-01T16:50:09.8282843Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-11-01T16:50:09.8284920Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-11-01T16:50:09.8287021Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-11-01T16:50:09.8288786Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeScratch/ 2024-11-01T16:50:09.8290342Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeTmp/ 2024-11-01T16:50:09.8291955Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/cmake.check_cache 2024-11-01T16:50:09.8293663Z creating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/ 2024-11-01T16:50:09.8295589Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.ts 2024-11-01T16:50:09.8297718Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/compiler_depend.make 2024-11-01T16:50:09.8299960Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/depend.make 2024-11-01T16:50:09.8301901Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/link.txt 2024-11-01T16:50:09.8303902Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/cmake_clean.cmake 2024-11-01T16:50:09.8305894Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/build.make 2024-11-01T16:50:09.8307890Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/DependInfo.cmake 2024-11-01T16:50:09.8309882Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/flags.make 2024-11-01T16:50:09.8311851Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/progress.make 2024-11-01T16:50:09.8313965Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o.d 2024-11-01T16:50:09.8353988Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/test_jit_hooks.dir/test_jit_hooks.cpp.o 2024-11-01T16:50:09.8356202Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-11-01T16:50:09.8358156Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/TargetDirectories.txt 2024-11-01T16:50:09.8359917Z extracting: build/custom_test_artifacts/jit-hook-build/CMakeFiles/progress.marks 2024-11-01T16:50:09.8361941Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile2 2024-11-01T16:50:09.8363635Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeFiles/Makefile.cmake 2024-11-01T16:50:09.8365219Z inflating: build/custom_test_artifacts/jit-hook-build/hip_new_types.cc 2024-11-01T16:50:09.8366701Z inflating: build/custom_test_artifacts/jit-hook-build/CMakeCache.txt 2024-11-01T16:50:09.8368097Z inflating: build/custom_test_artifacts/jit-hook-build/Makefile 2024-11-01T16:50:09.8369544Z inflating: build/custom_test_artifacts/jit-hook-build/cmake_install.cmake 2024-11-01T16:50:09.8404158Z inflating: build/custom_test_artifacts/jit-hook-build/test_jit_hooks 2024-11-01T16:50:09.8405682Z creating: build/custom_test_artifacts/custom-backend-build/ 2024-11-01T16:50:09.8407423Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/ 2024-11-01T16:50:09.8409041Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/pkgRedirects/ 2024-11-01T16:50:09.8411028Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeConfigureLog.yaml 2024-11-01T16:50:09.8412835Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/ 2024-11-01T16:50:09.8414629Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeSystem.cmake 2024-11-01T16:50:09.8416512Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/ 2024-11-01T16:50:09.8418584Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/tmp/ 2024-11-01T16:50:09.8420714Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/CMakeCCompilerId.c 2024-11-01T16:50:09.8422828Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdC/a.out 2024-11-01T16:50:09.8424740Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/ 2024-11-01T16:50:09.8426651Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/tmp/ 2024-11-01T16:50:09.8428841Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/CMakeCXXCompilerId.cpp 2024-11-01T16:50:09.8431028Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CompilerIdCXX/a.out 2024-11-01T16:50:09.8433176Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_C.bin 2024-11-01T16:50:09.8435333Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeCCompiler.cmake 2024-11-01T16:50:09.8437552Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeDetermineCompilerABI_CXX.bin 2024-11-01T16:50:09.8439752Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/3.26.4/CMakeCXXCompiler.cmake 2024-11-01T16:50:09.8441635Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeScratch/ 2024-11-01T16:50:09.8443314Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeTmp/ 2024-11-01T16:50:09.8445042Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/cmake.check_cache 2024-11-01T16:50:09.8446853Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/ 2024-11-01T16:50:09.8448903Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.ts 2024-11-01T16:50:09.8451164Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/compiler_depend.make 2024-11-01T16:50:09.8453340Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/depend.make 2024-11-01T16:50:09.8455394Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/link.txt 2024-11-01T16:50:09.8457523Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/cmake_clean.cmake 2024-11-01T16:50:09.8460225Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/build.make 2024-11-01T16:50:09.8462373Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/DependInfo.cmake 2024-11-01T16:50:09.8464487Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/flags.make 2024-11-01T16:50:09.8466626Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/progress.make 2024-11-01T16:50:09.8468897Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o.d 2024-11-01T16:50:09.8543614Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/custom_backend.dir/custom_backend.cpp.o 2024-11-01T16:50:09.8545904Z creating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/ 2024-11-01T16:50:09.8548080Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.ts 2024-11-01T16:50:09.8550436Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/compiler_depend.make 2024-11-01T16:50:09.8552703Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/depend.make 2024-11-01T16:50:09.8554866Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/link.txt 2024-11-01T16:50:09.8557099Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/cmake_clean.cmake 2024-11-01T16:50:09.8559355Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/build.make 2024-11-01T16:50:09.8561593Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/DependInfo.cmake 2024-11-01T16:50:09.8563816Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/flags.make 2024-11-01T16:50:09.8566012Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/progress.make 2024-11-01T16:50:09.8568429Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o.d 2024-11-01T16:50:09.8617617Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/test_custom_backend.dir/test_custom_backend.cpp.o 2024-11-01T16:50:09.8620153Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/CMakeDirectoryInformation.cmake 2024-11-01T16:50:09.8622248Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/TargetDirectories.txt 2024-11-01T16:50:09.8624127Z extracting: build/custom_test_artifacts/custom-backend-build/CMakeFiles/progress.marks 2024-11-01T16:50:09.8625873Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile2 2024-11-01T16:50:09.8627633Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeFiles/Makefile.cmake 2024-11-01T16:50:09.8629308Z inflating: build/custom_test_artifacts/custom-backend-build/hip_new_types.cc 2024-11-01T16:50:09.8630892Z inflating: build/custom_test_artifacts/custom-backend-build/CMakeCache.txt 2024-11-01T16:50:09.8632402Z inflating: build/custom_test_artifacts/custom-backend-build/Makefile 2024-11-01T16:50:09.8633940Z inflating: build/custom_test_artifacts/custom-backend-build/cmake_install.cmake 2024-11-01T16:50:09.8713738Z inflating: build/custom_test_artifacts/custom-backend-build/libcustom_backend.so 2024-11-01T16:50:09.8752175Z inflating: build/custom_test_artifacts/custom-backend-build/test_custom_backend 2024-11-01T16:50:09.8753363Z creating: build/lib/ 2024-11-01T16:50:09.8832683Z inflating: build/lib/libprotobuf-lite.a 2024-11-01T16:50:09.9242737Z inflating: build/lib/libprotobuf.a 2024-11-01T16:50:09.9251404Z inflating: build/lib/libpthreadpool.a 2024-11-01T16:50:09.9705265Z inflating: build/lib/libprotoc.a 2024-11-01T16:50:09.9712439Z inflating: build/lib/libcpuinfo.a 2024-11-01T16:50:09.9719488Z inflating: build/lib/libcpuinfo_internals.a 2024-11-01T16:50:09.9720590Z inflating: build/lib/libclog.a 2024-11-01T16:50:09.9737410Z inflating: build/lib/libpytorch_qnnpack.a 2024-11-01T16:50:09.9738773Z inflating: build/lib/libnnpack_reference_layers.a 2024-11-01T16:50:09.9755621Z inflating: build/lib/libnnpack.a 2024-11-01T16:50:09.9762439Z inflating: build/lib/libittnotify.a 2024-11-01T16:50:09.9786975Z inflating: build/lib/libtensorpipe_uv.a 2024-11-01T16:50:09.9845468Z inflating: build/lib/libgtest.a 2024-11-01T16:50:09.9847006Z inflating: build/lib/libgtest_main.a 2024-11-01T16:50:09.9913864Z inflating: build/lib/libbenchmark.a 2024-11-01T16:50:09.9914886Z inflating: build/lib/libbenchmark_main.a 2024-11-01T16:50:09.9930500Z inflating: build/lib/libgmock.a 2024-11-01T16:50:09.9987196Z inflating: build/lib/libasmjit.a 2024-11-01T16:50:09.9988185Z inflating: build/lib/libgmock_main.a 2024-11-01T16:50:10.1132496Z inflating: build/lib/libfbgemm.a 2024-11-01T16:50:10.1305619Z inflating: build/lib/libXNNPACK.a 2024-11-01T16:50:10.1825321Z inflating: build/lib/libtensorpipe.a 2024-11-01T16:50:10.1927342Z inflating: build/lib/libgloo.a 2024-11-01T16:50:10.1965864Z inflating: build/lib/libonnx_proto.a 2024-11-01T16:50:10.2321664Z inflating: build/lib/libgloo_hip.a 2024-11-01T16:50:10.2983486Z inflating: build/lib/libonnx.a 2024-11-01T16:50:11.2334136Z inflating: build/lib/libdnnl.a 2024-11-01T16:50:11.2352745Z inflating: build/lib/libfmt.a 2024-11-01T16:50:11.2644562Z inflating: build/lib/libkineto.a 2024-11-01T16:50:11.2733028Z inflating: build/lib/libc10.so 2024-11-01T16:50:11.2734051Z inflating: build/lib/libtorch_global_deps.so 2024-11-01T16:50:11.2776736Z inflating: build/lib/libc10_hip.so 2024-11-01T16:50:11.2777754Z inflating: build/lib/libcaffe2_nvrtc.so 2024-11-01T16:50:13.5017602Z inflating: build/lib/libtorch_cpu.so 2024-11-01T16:50:13.5022257Z inflating: build/lib/libunbox_lib.a 2024-11-01T16:50:13.5025446Z inflating: build/lib/libshm.so 2024-11-01T16:50:15.3929343Z inflating: build/lib/libtorch_hip.so 2024-11-01T16:50:15.3931472Z inflating: build/lib/libtorch.so 2024-11-01T16:50:15.3948779Z inflating: build/lib/libjitbackend_test.so 2024-11-01T16:50:15.3970643Z inflating: build/lib/libbackend_with_compiler.so 2024-11-01T16:50:15.4033248Z inflating: build/lib/libtorchbind_test.so 2024-11-01T16:50:15.4055083Z inflating: build/lib/libaoti_custom_ops.so 2024-11-01T16:50:15.5846340Z inflating: build/lib/libtorch_python.so 2024-11-01T16:50:15.5876862Z inflating: build/lib/libnnapi_backend.so 2024-11-01T16:50:15.5877804Z creating: build/bin/ 2024-11-01T16:50:15.5878523Z creating: build/bin/CMakeFiles/ 2024-11-01T16:50:15.5879370Z inflating: build/bin/cmake_install.cmake 2024-11-01T16:50:15.5880263Z inflating: build/bin/CTestTestfile.cmake 2024-11-01T16:50:15.6276635Z inflating: build/bin/protoc-3.13.0.0 2024-11-01T16:50:15.6673180Z inflating: build/bin/protoc 2024-11-01T16:50:15.6719741Z inflating: build/bin/c10_DeviceGuard_test 2024-11-01T16:50:15.6765773Z inflating: build/bin/c10_Device_test 2024-11-01T16:50:15.6810588Z inflating: build/bin/c10_CompileTimeFunctionPointer_test 2024-11-01T16:50:15.6863104Z inflating: build/bin/c10_DispatchKeySet_test 2024-11-01T16:50:15.6910683Z inflating: build/bin/c10_Scalar_test 2024-11-01T16:50:15.6954263Z inflating: build/bin/c10_StreamGuard_test 2024-11-01T16:50:15.7002801Z inflating: build/bin/c10_InlineDeviceGuard_test 2024-11-01T16:50:15.7048047Z inflating: build/bin/c10_SymInt_test 2024-11-01T16:50:15.7098979Z inflating: build/bin/c10_SizesAndStrides_test 2024-11-01T16:50:15.7148067Z inflating: build/bin/c10_InlineStreamGuard_test 2024-11-01T16:50:15.7211107Z inflating: build/bin/c10_cow_test 2024-11-01T16:50:15.7258477Z inflating: build/bin/c10_Bitset_test 2024-11-01T16:50:15.7302058Z inflating: build/bin/c10_ConstexprCrc_test 2024-11-01T16:50:15.7346674Z inflating: build/bin/c10_DeadlockDetection_test 2024-11-01T16:50:15.7391739Z inflating: build/bin/c10_Half_test 2024-11-01T16:50:15.7441982Z inflating: build/bin/c10_LeftRight_test 2024-11-01T16:50:15.7489432Z inflating: build/bin/c10_NetworkFlow_test 2024-11-01T16:50:15.7538756Z inflating: build/bin/c10_Metaprogramming_test 2024-11-01T16:50:15.7583287Z inflating: build/bin/c10_Synchronized_test 2024-11-01T16:50:15.7632904Z inflating: build/bin/c10_ThreadLocal_test 2024-11-01T16:50:15.7678902Z inflating: build/bin/c10_TypeList_test 2024-11-01T16:50:15.7724284Z inflating: build/bin/c10_TypeIndex_test 2024-11-01T16:50:15.7767883Z inflating: build/bin/c10_TypeTraits_test 2024-11-01T16:50:15.7814147Z inflating: build/bin/c10_accumulate_test 2024-11-01T16:50:15.7863660Z inflating: build/bin/c10_bfloat16_test 2024-11-01T16:50:15.7908564Z inflating: build/bin/c10_bit_cast_test 2024-11-01T16:50:15.7959478Z inflating: build/bin/c10_complex_math_test 2024-11-01T16:50:15.8008949Z inflating: build/bin/c10_complex_test 2024-11-01T16:50:15.8055865Z inflating: build/bin/c10_exception_test 2024-11-01T16:50:15.8100802Z inflating: build/bin/c10_flags_test 2024-11-01T16:50:15.8145457Z inflating: build/bin/c10_generic_math_test 2024-11-01T16:50:15.8190608Z inflating: build/bin/c10_irange_test 2024-11-01T16:50:15.8338760Z inflating: build/bin/c10_intrusive_ptr_test 2024-11-01T16:50:15.8386376Z inflating: build/bin/c10_lazy_test 2024-11-01T16:50:15.8437265Z inflating: build/bin/c10_logging_test 2024-11-01T16:50:15.8504834Z inflating: build/bin/c10_optional_test 2024-11-01T16:50:15.8560550Z inflating: build/bin/c10_ordered_preserving_dict_test 2024-11-01T16:50:15.8608326Z inflating: build/bin/c10_registry_test 2024-11-01T16:50:15.8654295Z inflating: build/bin/c10_ssize_test 2024-11-01T16:50:15.8700747Z inflating: build/bin/c10_string_util_test 2024-11-01T16:50:15.8836569Z inflating: build/bin/c10_small_vector_test 2024-11-01T16:50:15.8889584Z inflating: build/bin/c10_string_view_test 2024-11-01T16:50:15.8934446Z inflating: build/bin/c10_tempfile_test 2024-11-01T16:50:15.8977549Z inflating: build/bin/c10_intrusive_ptr_benchmark 2024-11-01T16:50:15.9027610Z inflating: build/bin/c10_typeid_test 2024-11-01T16:50:15.9071357Z inflating: build/bin/c10_hip_HIPAssertionsTest_1_var_test 2024-11-01T16:50:15.9115249Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_stream 2024-11-01T16:50:15.9158934Z inflating: build/bin/c10_hip_HIPAssertionsTest_catches_thread_and_block_and_device 2024-11-01T16:50:15.9202310Z inflating: build/bin/c10_hip_HIPAssertionsTest_from_2_processes 2024-11-01T16:50:15.9245980Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_blocks_and_threads 2024-11-01T16:50:15.9289422Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_multiple_blocks 2024-11-01T16:50:15.9332898Z inflating: build/bin/c10_hip_HIPAssertionsTest_multiple_writes_from_same_block 2024-11-01T16:50:15.9376337Z inflating: build/bin/c10_hip_HIPTest 2024-11-01T16:50:15.9719173Z inflating: build/bin/vec_test_all_types_DEFAULT 2024-11-01T16:50:16.0071310Z inflating: build/bin/vec_test_all_types_AVX512 2024-11-01T16:50:16.0434983Z inflating: build/bin/vec_test_all_types_AVX2 2024-11-01T16:50:16.0482855Z inflating: build/bin/test_edge_op_registration 2024-11-01T16:50:16.0529515Z inflating: build/bin/BackoffTest 2024-11-01T16:50:16.0576739Z inflating: build/bin/FileStoreTest 2024-11-01T16:50:16.0626469Z inflating: build/bin/TCPStoreTest 2024-11-01T16:50:16.0674350Z inflating: build/bin/HashStoreTest 2024-11-01T16:50:16.0732951Z inflating: build/bin/ProcessGroupGlooTest 2024-11-01T16:50:16.0735237Z inflating: build/bin/example_allreduce 2024-11-01T16:50:16.0739184Z inflating: build/bin/torch_shm_manager 2024-11-01T16:50:16.0792084Z inflating: build/bin/static_runtime_bench 2024-11-01T16:50:16.1017153Z inflating: build/bin/static_runtime_test 2024-11-01T16:50:16.1083041Z inflating: build/bin/Dict_test 2024-11-01T16:50:16.1129328Z inflating: build/bin/Dimname_test 2024-11-01T16:50:16.1187149Z inflating: build/bin/MaybeOwned_test 2024-11-01T16:50:16.1230730Z inflating: build/bin/hip_optional_test 2024-11-01T16:50:16.1282623Z inflating: build/bin/NamedTensor_test 2024-11-01T16:50:16.1335541Z inflating: build/bin/apply_utils_test 2024-11-01T16:50:16.1388140Z inflating: build/bin/atest 2024-11-01T16:50:16.1444836Z inflating: build/bin/basic 2024-11-01T16:50:16.1493180Z inflating: build/bin/broadcast_test 2024-11-01T16:50:16.1538917Z inflating: build/bin/cpu_allocator_test 2024-11-01T16:50:16.1590876Z inflating: build/bin/cpu_generator_test 2024-11-01T16:50:16.1638630Z inflating: build/bin/cpu_profiling_allocator_test 2024-11-01T16:50:16.1721480Z inflating: build/bin/cpu_rng_test 2024-11-01T16:50:16.1766068Z inflating: build/bin/dispatch_key_set_test 2024-11-01T16:50:16.1811261Z inflating: build/bin/dlconvertor_test 2024-11-01T16:50:16.1863452Z inflating: build/bin/extension_backend_test 2024-11-01T16:50:16.1912639Z inflating: build/bin/half_test 2024-11-01T16:50:16.1997798Z inflating: build/bin/ivalue_test 2024-11-01T16:50:16.2042282Z inflating: build/bin/lazy_tensor_test 2024-11-01T16:50:16.2090674Z inflating: build/bin/math_kernel_test 2024-11-01T16:50:16.2139028Z inflating: build/bin/memory_format_test 2024-11-01T16:50:16.2186308Z inflating: build/bin/memory_overlapping_test 2024-11-01T16:50:16.2233512Z inflating: build/bin/mobile_memory_cleanup 2024-11-01T16:50:16.2278568Z inflating: build/bin/operator_name_test 2024-11-01T16:50:16.2328513Z inflating: build/bin/native_test 2024-11-01T16:50:16.2374434Z inflating: build/bin/operators_test 2024-11-01T16:50:16.2420948Z inflating: build/bin/packedtensoraccessor_test 2024-11-01T16:50:16.2481339Z inflating: build/bin/pow_test 2024-11-01T16:50:16.2532228Z inflating: build/bin/quantized_test 2024-11-01T16:50:16.2576807Z inflating: build/bin/reduce_ops_test 2024-11-01T16:50:16.2622635Z inflating: build/bin/reportMemoryUsage_test 2024-11-01T16:50:16.2673075Z inflating: build/bin/scalar_tensor_test 2024-11-01T16:50:16.2719267Z inflating: build/bin/StorageUtils_test 2024-11-01T16:50:16.2771269Z inflating: build/bin/scalar_test 2024-11-01T16:50:16.2818393Z inflating: build/bin/stride_properties_test 2024-11-01T16:50:16.2889391Z inflating: build/bin/tensor_iterator_test 2024-11-01T16:50:16.2939104Z inflating: build/bin/type_ptr_test 2024-11-01T16:50:16.2987930Z inflating: build/bin/test_parallel 2024-11-01T16:50:16.2989600Z inflating: build/bin/thread_init_test 2024-11-01T16:50:16.3043603Z inflating: build/bin/type_test 2024-11-01T16:50:16.3090803Z inflating: build/bin/undefined_tensor_test 2024-11-01T16:50:16.3091849Z inflating: build/bin/verify_api_visibility 2024-11-01T16:50:16.3154340Z inflating: build/bin/legacy_vmap_test 2024-11-01T16:50:16.3200329Z inflating: build/bin/weakref_test 2024-11-01T16:50:16.3246345Z inflating: build/bin/wrapdim_test 2024-11-01T16:50:16.3292878Z inflating: build/bin/xla_tensor_test 2024-11-01T16:50:16.3346554Z inflating: build/bin/IListRef_test 2024-11-01T16:50:16.3442008Z inflating: build/bin/List_test 2024-11-01T16:50:16.3551180Z inflating: build/bin/kernel_function_legacy_test 2024-11-01T16:50:16.3638245Z inflating: build/bin/kernel_function_test 2024-11-01T16:50:16.3697713Z inflating: build/bin/KernelFunction_test 2024-11-01T16:50:16.3812949Z inflating: build/bin/kernel_lambda_legacy_test 2024-11-01T16:50:16.3906406Z inflating: build/bin/kernel_lambda_test 2024-11-01T16:50:16.3961204Z inflating: build/bin/kernel_stackbased_test 2024-11-01T16:50:16.4047631Z inflating: build/bin/make_boxed_from_unboxed_functor_test 2024-11-01T16:50:16.4091070Z inflating: build/bin/op_allowlist_test 2024-11-01T16:50:16.4136679Z inflating: build/bin/CppSignature_test 2024-11-01T16:50:16.4193721Z inflating: build/bin/inline_container_test 2024-11-01T16:50:16.4467072Z inflating: build/bin/op_registration_test 2024-11-01T16:50:16.4516760Z inflating: build/bin/backend_fallback_test 2024-11-01T16:50:16.4560345Z inflating: build/bin/hip_complex_math_test 2024-11-01T16:50:16.4607427Z inflating: build/bin/hip_apply_test 2024-11-01T16:50:16.4651017Z inflating: build/bin/hip_complex_test 2024-11-01T16:50:16.4694597Z inflating: build/bin/hip_distributions_test 2024-11-01T16:50:16.4739317Z inflating: build/bin/hip_generator_test 2024-11-01T16:50:16.4784354Z inflating: build/bin/hip_dlconvertor_test 2024-11-01T16:50:16.4828043Z inflating: build/bin/hip_half_test 2024-11-01T16:50:16.4871746Z inflating: build/bin/hip_integer_divider_test 2024-11-01T16:50:16.4916017Z inflating: build/bin/hip_packedtensoraccessor_test 2024-11-01T16:50:16.4959483Z inflating: build/bin/hip_vectorized_test 2024-11-01T16:50:16.5474606Z inflating: build/bin/test_jit 2024-11-01T16:50:16.5487523Z inflating: build/bin/tutorial_tensorexpr 2024-11-01T16:50:16.6196983Z inflating: build/bin/test_tensorexpr 2024-11-01T16:50:16.6246070Z inflating: build/bin/test_dist_autograd 2024-11-01T16:50:16.6307459Z inflating: build/bin/test_cpp_rpc 2024-11-01T16:50:16.6308968Z inflating: build/bin/parallel_benchmark 2024-11-01T16:50:16.7358687Z inflating: build/bin/test_api 2024-11-01T16:50:16.7418717Z inflating: build/bin/test_mobile_nnc 2024-11-01T16:50:16.7426699Z inflating: build/bin/aot_model_compiler_test 2024-11-01T16:50:16.7737708Z inflating: build/bin/test_lazy 2024-11-01T16:50:16.7738895Z creating: .additional_ci_files/ 2024-11-01T16:50:16.7808876Z inflating: .additional_ci_files/test-times.json 2024-11-01T16:50:16.8087494Z inflating: .additional_ci_files/test-class-times.json 2024-11-01T16:50:16.8148575Z ##[group]Run rm artifacts.zip 2024-11-01T16:50:16.8149299Z rm artifacts.zip 2024-11-01T16:50:16.8185200Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:16.8186146Z env: 2024-11-01T16:50:16.8186691Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:16.8187467Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:16.8188820Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:16.8190168Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:16.8191010Z AWS_REGION: us-east-1 2024-11-01T16:50:16.8191943Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:16.8193044Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:16.8206155Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:16.8206808Z ##[endgroup] 2024-11-01T16:50:17.0718527Z ##[group]Run df -H 2024-11-01T16:50:17.0719113Z df -H 2024-11-01T16:50:17.0751324Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:17.0752412Z env: 2024-11-01T16:50:17.0753041Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:17.0753960Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:17.0755634Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:17.0756992Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:17.0757698Z AWS_REGION: us-east-1 2024-11-01T16:50:17.0758449Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:17.0759372Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:17.0771953Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:17.0772596Z ##[endgroup] 2024-11-01T16:50:17.0861110Z Filesystem Size Used Avail Use% Mounted on 2024-11-01T16:50:17.0862458Z tmpfs 14G 18M 14G 1% /run 2024-11-01T16:50:17.0863896Z /dev/mapper/ubuntu--vg-ubuntu--lv 1.9T 361G 1.5T 21% / 2024-11-01T16:50:17.0865496Z tmpfs 68G 8.2k 68G 1% /dev/shm 2024-11-01T16:50:17.0866716Z tmpfs 5.3M 0 5.3M 0% /run/lock 2024-11-01T16:50:17.0867946Z /dev/sda2 2.1G 339M 1.6G 18% /boot 2024-11-01T16:50:17.0869243Z /dev/sda1 1.2G 6.4M 1.2G 1% /boot/efi 2024-11-01T16:50:17.0870766Z tmpfs 14G 17k 14G 1% /run/user/1001 2024-11-01T16:50:17.0933584Z Prepare all required actions 2024-11-01T16:50:17.0934459Z Getting action download info 2024-11-01T16:50:17.6408695Z ##[group]Run ./.github/actions/download-td-artifacts 2024-11-01T16:50:17.6409103Z with: 2024-11-01T16:50:17.6409335Z env: 2024-11-01T16:50:17.6409580Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:17.6409926Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:17.6410527Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:17.6411147Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:17.6411468Z AWS_REGION: us-east-1 2024-11-01T16:50:17.6411837Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:17.6412267Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:17.6417241Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:17.6417533Z ##[endgroup] 2024-11-01T16:50:17.6447781Z ##[group]Run seemethere/download-artifact-s3@v4 2024-11-01T16:50:17.6448145Z with: 2024-11-01T16:50:17.6448373Z name: td_results 2024-11-01T16:50:17.6448690Z s3-bucket: gha-artifacts 2024-11-01T16:50:17.6448978Z region: us-east-1 2024-11-01T16:50:17.6449219Z env: 2024-11-01T16:50:17.6449451Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:17.6449783Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:17.6450364Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:17.6450913Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:17.6451225Z AWS_REGION: us-east-1 2024-11-01T16:50:17.6451571Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:17.6451968Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:17.6456960Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:17.6457239Z ##[endgroup] 2024-11-01T16:50:18.0673390Z (node:2661127) NOTE: We are formalizing our plans to enter AWS SDK for JavaScript (v2) into maintenance mode in 2023. 2024-11-01T16:50:18.0674800Z 2024-11-01T16:50:18.0675330Z Please migrate your code to use AWS SDK for JavaScript (v3). 2024-11-01T16:50:18.0676683Z For more information, check the migration guide at https://a.co/7PzMCcy 2024-11-01T16:50:18.0678339Z (Use `node --trace-warnings ...` to show where the warning was created) 2024-11-01T16:50:18.3366895Z Found 1 objects with prefix pytorch/pytorch/11632515612/td_results/ 2024-11-01T16:50:18.3368863Z Starting download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/td_results.json 2024-11-01T16:50:18.6849962Z Finished download (1/1): /home/pytorchci/actions-runner/_work/pytorch/pytorch/td_results.json 2024-11-01T16:50:18.6863026Z Artifact download has finished successfully 2024-11-01T16:50:18.7440255Z ##[group]Run mkdir -p .additional_ci_files 2024-11-01T16:50:18.7441078Z mkdir -p .additional_ci_files 2024-11-01T16:50:18.7442299Z mv td_results.json .additional_ci_files/td_results.json 2024-11-01T16:50:18.7477288Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:18.7477784Z env: 2024-11-01T16:50:18.7478082Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:18.7478522Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:18.7479256Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:18.7479942Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:18.7480392Z AWS_REGION: us-east-1 2024-11-01T16:50:18.7481280Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:18.7482379Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:18.7494605Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:18.7495257Z ##[endgroup] 2024-11-01T16:50:18.7675184Z ##[group]Run .github/scripts/parse_ref.py 2024-11-01T16:50:18.7676121Z .github/scripts/parse_ref.py 2024-11-01T16:50:18.7709320Z shell: /usr/bin/bash -e {0} 2024-11-01T16:50:18.7710002Z env: 2024-11-01T16:50:18.7710532Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:18.7711307Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:18.7712718Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:18.7714023Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:18.7714742Z AWS_REGION: us-east-1 2024-11-01T16:50:18.7715518Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:18.7716450Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:18.7728549Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:18.7729182Z ##[endgroup] 2024-11-01T16:50:18.8052317Z Prepare all required actions 2024-11-01T16:50:18.8127999Z ##[group]Run ./.github/actions/get-workflow-job-id 2024-11-01T16:50:18.8128851Z with: 2024-11-01T16:50:18.8129906Z github-token: *** 2024-11-01T16:50:18.8130536Z env: 2024-11-01T16:50:18.8131060Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:18.8131837Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:18.8133209Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:18.8134524Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:18.8135234Z AWS_REGION: us-east-1 2024-11-01T16:50:18.8135972Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:18.8136899Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:18.8149248Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:18.8149891Z ##[endgroup] 2024-11-01T16:50:18.8185925Z ##[group]Run set -eux 2024-11-01T16:50:18.8186559Z set -eux 2024-11-01T16:50:18.8187670Z python3 .github/scripts/get_workflow_job_id.py "${GITHUB_RUN_ID}" "${RUNNER_NAME}" 2024-11-01T16:50:18.8221234Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:18.8222171Z env: 2024-11-01T16:50:18.8222715Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:18.8223501Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:18.8224873Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:18.8226155Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:18.8226865Z AWS_REGION: us-east-1 2024-11-01T16:50:18.8227623Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:18.8228540Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:18.8241584Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:18.8242615Z GITHUB_TOKEN: *** 2024-11-01T16:50:18.8243316Z ##[endgroup] 2024-11-01T16:50:18.8308045Z + python3 .github/scripts/get_workflow_job_id.py 11632515612 pytorch-rocm-hw-31 2024-11-01T16:50:19.8760155Z setting job-id=32397140314 2024-11-01T16:50:19.8761655Z setting job-name=linux-focal-rocm6.2-py3.10 / test (default, 1, 2, linux.rocm.gpu) 2024-11-01T16:50:19.9200679Z Prepare all required actions 2024-11-01T16:50:19.9201583Z Getting action download info 2024-11-01T16:50:20.1438890Z ##[group]Run ./.github/actions/filter-test-configs 2024-11-01T16:50:20.1439774Z with: 2024-11-01T16:50:20.1440730Z github-token: *** 2024-11-01T16:50:20.1443342Z test-matrix: {"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu"}, {"config": "distributed", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]} 2024-11-01T16:50:20.1446057Z job-name: linux-focal-rocm6.2-py3.10 / test (default, 1, 2, linux.rocm.gpu) 2024-11-01T16:50:20.1447155Z env: 2024-11-01T16:50:20.1447694Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:20.1448475Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:20.1449852Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:20.1451229Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:20.1451957Z AWS_REGION: us-east-1 2024-11-01T16:50:20.1452685Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:20.1453613Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:20.1466402Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:20.1467050Z ##[endgroup] 2024-11-01T16:50:20.1551040Z ##[group]Run nick-fields/retry@v3.0.0 2024-11-01T16:50:20.1551930Z with: 2024-11-01T16:50:20.1552535Z shell: bash 2024-11-01T16:50:20.1553202Z timeout_minutes: 10 2024-11-01T16:50:20.1553931Z max_attempts: 5 2024-11-01T16:50:20.1554536Z retry_wait_seconds: 30 2024-11-01T16:50:20.1556155Z command: set -eux # PyYAML 6.0 doesn't work with MacOS x86 anymore # This must run on Python-3.7 (AmazonLinux2) so can't use request=3.32.2 python3 -m pip install requests==2.27.1 pyyaml==6.0.1 2024-11-01T16:50:20.1557744Z polling_interval_seconds: 1 2024-11-01T16:50:20.1558280Z warning_on_retry: true 2024-11-01T16:50:20.1558776Z continue_on_error: false 2024-11-01T16:50:20.1559254Z env: 2024-11-01T16:50:20.1559656Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:20.1560266Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:20.1561487Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:20.1562652Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:20.1563247Z AWS_REGION: us-east-1 2024-11-01T16:50:20.1563815Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:20.1564510Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:20.1573614Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:20.1574276Z GITHUB_TOKEN: *** 2024-11-01T16:50:20.1574725Z ##[endgroup] 2024-11-01T16:50:20.2311651Z + python3 -m pip install requests==2.27.1 pyyaml==6.0.1 2024-11-01T16:50:20.4847206Z Defaulting to user installation because normal site-packages is not writeable 2024-11-01T16:50:20.5613855Z Requirement already satisfied: requests==2.27.1 in /home/pytorchci/.local/lib/python3.10/site-packages (2.27.1) 2024-11-01T16:50:20.5618544Z Requirement already satisfied: pyyaml==6.0.1 in /home/pytorchci/.local/lib/python3.10/site-packages (6.0.1) 2024-11-01T16:50:20.5707662Z Requirement already satisfied: certifi>=2017.4.17 in /usr/lib/python3/dist-packages (from requests==2.27.1) (2020.6.20) 2024-11-01T16:50:20.5715900Z Requirement already satisfied: charset-normalizer~=2.0.0 in /home/pytorchci/.local/lib/python3.10/site-packages (from requests==2.27.1) (2.0.12) 2024-11-01T16:50:20.5725343Z Requirement already satisfied: idna<4,>=2.5 in /usr/lib/python3/dist-packages (from requests==2.27.1) (3.3) 2024-11-01T16:50:20.5729762Z Requirement already satisfied: urllib3<1.27,>=1.21.1 in /usr/lib/python3/dist-packages (from requests==2.27.1) (1.26.5) 2024-11-01T16:50:21.2314803Z Command completed after 1 attempt(s). 2024-11-01T16:50:21.2450219Z ##[group]Run set -x 2024-11-01T16:50:21.2450861Z set -x 2024-11-01T16:50:21.2451419Z  2024-11-01T16:50:21.2452437Z # Use relative path here as this could be checked out anywhere, not necessarily 2024-11-01T16:50:21.2454198Z # in runner workspace 2024-11-01T16:50:21.2455205Z python3 "${GITHUB_ACTION_PATH}/../../scripts/parse_ref.py" 2024-11-01T16:50:21.2490519Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:21.2491494Z env: 2024-11-01T16:50:21.2492048Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:21.2492833Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:21.2494219Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:21.2495525Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:21.2496239Z AWS_REGION: us-east-1 2024-11-01T16:50:21.2497016Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:21.2497942Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:21.2510273Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:21.2510911Z ##[endgroup] 2024-11-01T16:50:21.2590525Z + python3 /home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/filter-test-configs/../../scripts/parse_ref.py 2024-11-01T16:50:21.2830048Z ##[group]Run echo "Workflow: ${GITHUB_WORKFLOW}" 2024-11-01T16:50:21.2831042Z echo "Workflow: ${GITHUB_WORKFLOW}" 2024-11-01T16:50:21.2831898Z echo "Job name: ${JOB_NAME}" 2024-11-01T16:50:21.2833005Z  2024-11-01T16:50:21.2833833Z # Use relative path here as this could be checked out anywhere, not necessarily 2024-11-01T16:50:21.2834472Z # in runner workspace 2024-11-01T16:50:21.2835025Z python3 "${GITHUB_ACTION_PATH}/../../scripts/filter_test_configs.py" \ 2024-11-01T16:50:21.2835655Z  --workflow "${GITHUB_WORKFLOW}" \ 2024-11-01T16:50:21.2836100Z  --job-name "${JOB_NAME}" \ 2024-11-01T16:50:21.2837371Z  --test-matrix "{"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu"}, {"config": "distributed", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}" \ 2024-11-01T16:50:21.2838645Z  --selected-test-configs "" \ 2024-11-01T16:50:21.2839090Z  --pr-number "${PR_NUMBER}" \ 2024-11-01T16:50:21.2839506Z  --tag "${TAG}" \ 2024-11-01T16:50:21.2839903Z  --event-name "${EVENT_NAME}" \ 2024-11-01T16:50:21.2840338Z  --schedule "${SCHEDULE}" \ 2024-11-01T16:50:21.2840764Z  --branch "${HEAD_BRANCH}" 2024-11-01T16:50:21.2874555Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:21.2875052Z env: 2024-11-01T16:50:21.2875346Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:21.2875767Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:21.2876481Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:21.2877153Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:21.2877531Z AWS_REGION: us-east-1 2024-11-01T16:50:21.2877952Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:21.2878439Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:21.2887694Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:21.2888754Z GITHUB_TOKEN: *** 2024-11-01T16:50:21.2889743Z JOB_NAME: linux-focal-rocm6.2-py3.10 / test (default, 1, 2, linux.rocm.gpu) 2024-11-01T16:50:21.2890855Z PR_NUMBER: 2024-11-01T16:50:21.2891432Z TAG: ciflow/trunk/138766 2024-11-01T16:50:21.2892184Z EVENT_NAME: push 2024-11-01T16:50:21.2892866Z SCHEDULE: 2024-11-01T16:50:21.2893514Z HEAD_BRANCH: 2024-11-01T16:50:21.2894181Z ##[endgroup] 2024-11-01T16:50:21.2948648Z Workflow: trunk 2024-11-01T16:50:21.2950062Z Job name: linux-focal-rocm6.2-py3.10 / test (default, 1, 2, linux.rocm.gpu) 2024-11-01T16:50:21.5979793Z INFO:root:Found no test-config label on the PR, so all test configs are included 2024-11-01T16:50:22.1471617Z ##[group]Run echo "Filtered matrix:" 2024-11-01T16:50:22.1472061Z echo "Filtered matrix:" 2024-11-01T16:50:22.1473538Z echo "{"include": [{"config": "default", "shard": 1, "num_shards": 2, "runner": "linux.rocm.gpu"}, {"config": "default", "shard": 2, "num_shards": 2, "runner": "linux.rocm.gpu"}, {"config": "distributed", "shard": 1, "num_shards": 1, "runner": "linux.rocm.gpu"}]}" 2024-11-01T16:50:22.1474729Z  2024-11-01T16:50:22.1475027Z echo 2024-11-01T16:50:22.1475407Z echo "Is the current job unstable? False" 2024-11-01T16:50:22.1475851Z  2024-11-01T16:50:22.1476130Z echo 2024-11-01T16:50:22.1476490Z echo "Is keep-going label set? False" 2024-11-01T16:50:22.1476920Z  2024-11-01T16:50:22.1477199Z echo 2024-11-01T16:50:22.1477521Z echo "Renabled issues? " 2024-11-01T16:50:22.1507187Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:22.1508128Z env: 2024-11-01T16:50:22.1508672Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:22.1509452Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:22.1510830Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:22.1512140Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:22.1512850Z AWS_REGION: us-east-1 2024-11-01T16:50:22.1513734Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:22.1515282Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:22.1523727Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:22.1524073Z ##[endgroup] 2024-11-01T16:50:22.1588911Z Filtered matrix: 2024-11-01T16:50:22.1591352Z {include: [{config: default, shard: 1, num_shards: 2, runner: linux.rocm.gpu}, {config: default, shard: 2, num_shards: 2, runner: linux.rocm.gpu}, {config: distributed, shard: 1, num_shards: 1, runner: linux.rocm.gpu}]} 2024-11-01T16:50:22.1593368Z 2024-11-01T16:50:22.1593760Z Is the current job unstable? False 2024-11-01T16:50:22.1594309Z 2024-11-01T16:50:22.1594811Z Is keep-going label set? False 2024-11-01T16:50:22.1595302Z 2024-11-01T16:50:22.1595582Z Renabled issues? 2024-11-01T16:50:22.1671018Z ##[group]Run echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2024-11-01T16:50:22.1672371Z echo "timeout=$((JOB_TIMEOUT-30))" >> "${GITHUB_OUTPUT}" 2024-11-01T16:50:22.1696543Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T16:50:22.1697504Z env: 2024-11-01T16:50:22.1698062Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:22.1698994Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:22.1700395Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:22.1701715Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:22.1702542Z AWS_REGION: us-east-1 2024-11-01T16:50:22.1703437Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:22.1704530Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:22.1715709Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:22.1716061Z JOB_TIMEOUT: 300 2024-11-01T16:50:22.1716382Z ##[endgroup] 2024-11-01T16:50:22.1871905Z ##[group]Run set -x 2024-11-01T16:50:22.1872349Z set -x 2024-11-01T16:50:22.1872671Z  2024-11-01T16:50:22.1873045Z if [[ $TEST_CONFIG == 'multigpu' ]]; then 2024-11-01T16:50:22.1873589Z  TEST_COMMAND=.ci/pytorch/multigpu-test.sh 2024-11-01T16:50:22.1874138Z elif [[ $BUILD_ENVIRONMENT == *onnx* ]]; then 2024-11-01T16:50:22.1874668Z  TEST_COMMAND=.ci/caffe2/test.sh 2024-11-01T16:50:22.1875089Z else 2024-11-01T16:50:22.1875439Z  TEST_COMMAND=.ci/pytorch/test.sh 2024-11-01T16:50:22.1875856Z fi 2024-11-01T16:50:22.1876135Z  2024-11-01T16:50:22.1876599Z # detached container should get cleaned up by teardown_ec2_linux 2024-11-01T16:50:22.1877322Z # TODO: Stop building test binaries as part of the build phase 2024-11-01T16:50:22.1877966Z # Used for GPU_FLAG since that doesn't play nice 2024-11-01T16:50:22.1878538Z # shellcheck disable=SC2086,SC2090 2024-11-01T16:50:22.1879002Z container_name=$(docker run \ 2024-11-01T16:50:22.1879423Z  ${GPU_FLAG:-} \ 2024-11-01T16:50:22.1879796Z  -e BUILD_ENVIRONMENT \ 2024-11-01T16:50:22.1880198Z  -e PR_NUMBER \ 2024-11-01T16:50:22.1880564Z  -e GITHUB_ACTIONS \ 2024-11-01T16:50:22.1880951Z  -e GITHUB_REPOSITORY \ 2024-11-01T16:50:22.1881358Z  -e GITHUB_WORKFLOW \ 2024-11-01T16:50:22.1881736Z  -e GITHUB_JOB \ 2024-11-01T16:50:22.1882124Z  -e GITHUB_RUN_ID \ 2024-11-01T16:50:22.1882635Z  -e GITHUB_RUN_NUMBER \ 2024-11-01T16:50:22.1883510Z  -e GITHUB_RUN_ATTEMPT \ 2024-11-01T16:50:22.1884367Z  -e JOB_ID \ 2024-11-01T16:50:22.1885091Z  -e JOB_NAME \ 2024-11-01T16:50:22.1885753Z  -e BRANCH \ 2024-11-01T16:50:22.1886396Z  -e SHA1 \ 2024-11-01T16:50:22.1887049Z  -e AWS_DEFAULT_REGION \ 2024-11-01T16:50:22.1887811Z  -e IN_WHEEL_TEST \ 2024-11-01T16:50:22.1888521Z  -e SHARD_NUMBER \ 2024-11-01T16:50:22.1889221Z  -e TEST_CONFIG \ 2024-11-01T16:50:22.1889908Z  -e NUM_TEST_SHARDS \ 2024-11-01T16:50:22.1890637Z  -e REENABLED_ISSUES \ 2024-11-01T16:50:22.1891417Z  -e CONTINUE_THROUGH_ERROR \ 2024-11-01T16:50:22.1892198Z  -e VERBOSE_TEST_LOGS \ 2024-11-01T16:50:22.1893323Z  -e TEST_SHOWLOCALS \ 2024-11-01T16:50:22.1894029Z  -e NO_TEST_TIMEOUT \ 2024-11-01T16:50:22.1894701Z  -e NO_TD \ 2024-11-01T16:50:22.1895419Z  -e MAX_JOBS="$(nproc --ignore=2)" \ 2024-11-01T16:50:22.1896257Z  -e SCCACHE_BUCKET \ 2024-11-01T16:50:22.1897033Z  -e XLA_CLANG_CACHE_S3_BUCKET_NAME \ 2024-11-01T16:50:22.1897960Z  -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK \ 2024-11-01T16:50:22.1899129Z  -e PYTORCH_TEST_RERUN_DISABLED_TESTS \ 2024-11-01T16:50:22.1900004Z  -e TESTS_TO_INCLUDE \ 2024-11-01T16:50:22.1900903Z  --env-file="/tmp/github_env_${GITHUB_RUN_ID}" \ 2024-11-01T16:50:22.1901882Z  --ulimit stack=10485760:83886080 \ 2024-11-01T16:50:22.1902828Z  --ulimit core=0 \ 2024-11-01T16:50:22.1903737Z  --security-opt seccomp=unconfined \ 2024-11-01T16:50:22.1904736Z  --cap-add=SYS_PTRACE \ 2024-11-01T16:50:22.1905596Z  --shm-size="8g" \ 2024-11-01T16:50:22.1906386Z  --tty \ 2024-11-01T16:50:22.1907073Z  --detach \ 2024-11-01T16:50:22.1907541Z  --name="${container_name}" \ 2024-11-01T16:50:22.1908000Z  --user jenkins \ 2024-11-01T16:50:22.1908473Z  -v "${GITHUB_WORKSPACE}:/var/lib/jenkins/workspace" \ 2024-11-01T16:50:22.1909012Z  -w /var/lib/jenkins/workspace \ 2024-11-01T16:50:22.1909438Z  "${DOCKER_IMAGE}" 2024-11-01T16:50:22.1909788Z ) 2024-11-01T16:50:22.1910129Z # save container name for later step 2024-11-01T16:50:22.1910900Z echo "CONTAINER_NAME=${container_name}" >> "$GITHUB_ENV" 2024-11-01T16:50:22.1911870Z # jenkins user does not have write permission to mounted workspace; work-around by copying within container to jenkins home 2024-11-01T16:50:22.1913124Z docker exec -t "${container_name}" sh -c "cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && ${TEST_COMMAND}" 2024-11-01T16:50:22.1938375Z shell: /usr/bin/bash -e {0} 2024-11-01T16:50:22.1939076Z env: 2024-11-01T16:50:22.1939619Z GIT_DEFAULT_BRANCH: main 2024-11-01T16:50:22.1940405Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T16:50:22.1941796Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T16:50:22.1943338Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T16:50:22.1944199Z AWS_REGION: us-east-1 2024-11-01T16:50:22.1945103Z AWS_ACCESS_KEY_ID: *** 2024-11-01T16:50:22.1946182Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T16:50:22.1953023Z AWS_SESSION_TOKEN: *** 2024-11-01T16:50:22.1953440Z BUILD_ENVIRONMENT: linux-focal-rocm6.2-py3.10 2024-11-01T16:50:22.1953889Z PR_NUMBER: 2024-11-01T16:50:22.1954220Z GITHUB_REPOSITORY: pytorch/pytorch 2024-11-01T16:50:22.1954627Z GITHUB_WORKFLOW: trunk 2024-11-01T16:50:22.1954973Z GITHUB_JOB: test 2024-11-01T16:50:22.1955294Z GITHUB_RUN_ID: 11632515612 2024-11-01T16:50:22.1955663Z GITHUB_RUN_NUMBER: 102766 2024-11-01T16:50:22.1956018Z GITHUB_RUN_ATTEMPT: 1 2024-11-01T16:50:22.1956354Z JOB_ID: 32397140314 2024-11-01T16:50:22.1956874Z JOB_NAME: linux-focal-rocm6.2-py3.10 / test (default, 1, 2, linux.rocm.gpu) 2024-11-01T16:50:22.1957456Z BRANCH: 2024-11-01T16:50:22.1957802Z SHA1: d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:50:22.1958265Z CONTINUE_THROUGH_ERROR: False 2024-11-01T16:50:22.1958644Z VERBOSE_TEST_LOGS: False 2024-11-01T16:50:22.1959003Z TEST_SHOWLOCALS: False 2024-11-01T16:50:22.1959344Z NO_TEST_TIMEOUT: False 2024-11-01T16:50:22.1959680Z NO_TD: False 2024-11-01T16:50:22.1959983Z TEST_CONFIG: default 2024-11-01T16:50:22.1960309Z SHARD_NUMBER: 1 2024-11-01T16:50:22.1960622Z NUM_TEST_SHARDS: 2 2024-11-01T16:50:22.1960948Z REENABLED_ISSUES: 2024-11-01T16:50:22.1961346Z SCCACHE_BUCKET: ossci-compiler-cache-circleci-v2 2024-11-01T16:50:22.1962386Z DOCKER_IMAGE: 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:50:22.1965136Z XLA_CLANG_CACHE_S3_BUCKET_NAME: ossci-compiler-clang-cache-circleci-xla 2024-11-01T16:50:22.1966261Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK: 0 2024-11-01T16:50:22.1967065Z PYTORCH_TEST_RERUN_DISABLED_TESTS: 0 2024-11-01T16:50:22.1969124Z TESTS_TO_INCLUDE: test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor distributed/test_c10d_common distributed/test_c10d_nccl 2024-11-01T16:50:22.1971134Z ##[endgroup] 2024-11-01T16:50:22.2019173Z + [[ default == \m\u\l\t\i\g\p\u ]] 2024-11-01T16:50:22.2020274Z + [[ linux-focal-rocm6.2-py3.10 == *onnx* ]] 2024-11-01T16:50:22.2021111Z + TEST_COMMAND=.ci/pytorch/test.sh 2024-11-01T16:50:22.2033020Z +++ nproc --ignore=2 2024-11-01T16:50:22.2065988Z ++ docker run --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon -e BUILD_ENVIRONMENT -e PR_NUMBER -e GITHUB_ACTIONS -e GITHUB_REPOSITORY -e GITHUB_WORKFLOW -e GITHUB_JOB -e GITHUB_RUN_ID -e GITHUB_RUN_NUMBER -e GITHUB_RUN_ATTEMPT -e JOB_ID -e JOB_NAME -e BRANCH -e SHA1 -e AWS_DEFAULT_REGION -e IN_WHEEL_TEST -e SHARD_NUMBER -e TEST_CONFIG -e NUM_TEST_SHARDS -e REENABLED_ISSUES -e CONTINUE_THROUGH_ERROR -e VERBOSE_TEST_LOGS -e TEST_SHOWLOCALS -e NO_TEST_TIMEOUT -e NO_TD -e MAX_JOBS=62 -e SCCACHE_BUCKET -e XLA_CLANG_CACHE_S3_BUCKET_NAME -e PYTORCH_TEST_CUDA_MEM_LEAK_CHECK -e PYTORCH_TEST_RERUN_DISABLED_TESTS -e TESTS_TO_INCLUDE --env-file=/tmp/github_env_11632515612 --ulimit stack=10485760:83886080 --ulimit core=0 --security-opt seccomp=unconfined --cap-add=SYS_PTRACE --shm-size=8g --tty --detach --name= --user jenkins -v /home/pytorchci/actions-runner/_work/pytorch/pytorch:/var/lib/jenkins/workspace -w /var/lib/jenkins/workspace 308535385114.dkr.ecr.us-east-1.amazonaws.com/pytorch/pytorch-linux-focal-rocm-n-py3:bdd298b12da59246147f016e0693ffd722419941 2024-11-01T16:50:24.7769619Z + container_name=ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T16:50:24.7770574Z + echo CONTAINER_NAME=ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T16:50:24.7772907Z + docker exec -t ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 sh -c 'cd .. && cp -R workspace pytorch && cd pytorch && pip install dist/*.whl && .ci/pytorch/test.sh' 2024-11-01T16:50:36.3050854Z Processing ./dist/torch-2.6.0a0+gitd1aa4ef-cp310-cp310-linux_x86_64.whl 2024-11-01T16:50:36.9341635Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.6.0a0+gitd1aa4ef) (3.16.1) 2024-11-01T16:50:36.9344355Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.6.0a0+gitd1aa4ef) (4.12.2) 2024-11-01T16:50:36.9345921Z Requirement already satisfied: networkx in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.6.0a0+gitd1aa4ef) (2.8.8) 2024-11-01T16:50:36.9347327Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.6.0a0+gitd1aa4ef) (3.1.4) 2024-11-01T16:50:36.9348775Z Requirement already satisfied: fsspec in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.6.0a0+gitd1aa4ef) (2024.10.0) 2024-11-01T16:50:36.9353662Z Requirement already satisfied: sympy==1.13.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch==2.6.0a0+gitd1aa4ef) (1.13.1) 2024-11-01T16:50:36.9366966Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy==1.13.1->torch==2.6.0a0+gitd1aa4ef) (1.3.0) 2024-11-01T16:50:36.9675146Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch==2.6.0a0+gitd1aa4ef) (3.0.2) 2024-11-01T16:50:37.2290485Z Installing collected packages: torch 2024-11-01T16:50:48.2851854Z Successfully installed torch-2.6.0a0+gitd1aa4ef 2024-11-01T16:50:48.3394706Z + export TERM=vt100 2024-11-01T16:50:48.3396121Z + TERM=vt100 2024-11-01T16:50:48.3398595Z ++ dirname .ci/pytorch/test.sh 2024-11-01T16:50:48.3410414Z + source .ci/pytorch/common.sh 2024-11-01T16:50:48.3417299Z +++ dirname .ci/pytorch/common.sh 2024-11-01T16:50:48.3437331Z ++ source .ci/pytorch/common_utils.sh 2024-11-01T16:50:48.3438434Z +++ declare -f -t trap_add 2024-11-01T16:50:48.3445123Z ++ set -ex 2024-11-01T16:50:48.3446013Z ++ [[ linux-focal-rocm6.2-py3.10 == *rocm* ]] 2024-11-01T16:50:48.3446902Z ++ unset HIP_PLATFORM 2024-11-01T16:50:48.3453209Z ++ export PYTORCH_TEST_WITH_ROCM=1 2024-11-01T16:50:48.3453800Z ++ PYTORCH_TEST_WITH_ROCM=1 2024-11-01T16:50:48.3454231Z ++ export HSAKMT_DEBUG_LEVEL=4 2024-11-01T16:50:48.3454619Z ++ HSAKMT_DEBUG_LEVEL=4 2024-11-01T16:50:48.3455021Z ++ export HSA_FORCE_FINE_GRAIN_PCIE=1 2024-11-01T16:50:48.3458257Z ++ HSA_FORCE_FINE_GRAIN_PCIE=1 2024-11-01T16:50:48.3458627Z ++ BUILD_TEST_LIBTORCH=0 2024-11-01T16:50:48.3459284Z + [[ linux-focal-rocm6.2-py3.10 != *rocm* ]] 2024-11-01T16:50:48.3459686Z + echo 'Environment variables:' 2024-11-01T16:50:48.3460051Z Environment variables: 2024-11-01T16:50:48.3460325Z + env 2024-11-01T16:50:48.3462856Z INSTALLED_DB=yes 2024-11-01T16:50:48.3464886Z GITHUB_WORKSPACE=/home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-11-01T16:50:48.3466182Z AOTRITON_INSTALLED_PREFIX=/opt/rocm/aotriton 2024-11-01T16:50:48.3466948Z CONTINUE_THROUGH_ERROR=False 2024-11-01T16:50:48.3467753Z BUILD_ENVIRONMENT=linux-focal-rocm6.2-py3.10 2024-11-01T16:50:48.3468458Z HOSTNAME=ff4898059595 2024-11-01T16:50:48.3470374Z GITHUB_PATH=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/add_path_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3471060Z GITHUB_ACTION=__self 2024-11-01T16:50:48.3471358Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2024-11-01T16:50:48.3471694Z GITHUB_RUN_NUMBER=102766 2024-11-01T16:50:48.3471983Z TEST_CONFIG=default 2024-11-01T16:50:48.3472271Z GITHUB_REPOSITORY_OWNER_ID=21003710 2024-11-01T16:50:48.3472640Z AWS_DEFAULT_REGION=us-east-1 2024-11-01T16:50:48.3473042Z GITHUB_TRIGGERING_ACTOR=pytorch-bot[bot] 2024-11-01T16:50:48.3473393Z GITHUB_REF_TYPE=tag 2024-11-01T16:50:48.3485458Z *** 2024-11-01T16:50:48.3485746Z GITHUB_REPOSITORY_ID=65600975 2024-11-01T16:50:48.3486077Z GITHUB_ACTIONS=true 2024-11-01T16:50:48.3486442Z SHA1=d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:50:48.3486873Z GITHUB_SHA=d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:50:48.3487477Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk.yml@refs/tags/ciflow/trunk/138766 2024-11-01T16:50:48.3488022Z VERBOSE_TEST_LOGS=False 2024-11-01T16:50:48.3488348Z GITHUB_REF=refs/tags/ciflow/trunk/138766 2024-11-01T16:50:48.3488689Z SHARD_NUMBER=1 2024-11-01T16:50:48.3488954Z GITHUB_REF_PROTECTED=false 2024-11-01T16:50:48.3489252Z HOME=/var/lib/jenkins 2024-11-01T16:50:48.3489574Z GITHUB_API_URL=https://api.github.com 2024-11-01T16:50:48.3489947Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2024-11-01T16:50:48.3490322Z LANG=C.UTF-8 2024-11-01T16:50:48.3490577Z PYTORCH_TEST_WITH_ROCM=1 2024-11-01T16:50:48.3490870Z NUM_TEST_SHARDS=2 2024-11-01T16:50:48.3491547Z GITHUB_STATE=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/save_state_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3492389Z JOB_NAME=linux-focal-rocm6.2-py3.10 / test (default, 1, 2, linux.rocm.gpu) 2024-11-01T16:50:48.3492876Z MAGMA_HOME=/opt/rocm/magma 2024-11-01T16:50:48.3493552Z GITHUB_ENV=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_env_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3494173Z HSAKMT_DEBUG_LEVEL=4 2024-11-01T16:50:48.3494708Z GITHUB_EVENT_PATH=/home/pytorchci/actions-runner/_work/_temp/_github_workflow/event.json 2024-11-01T16:50:48.3495232Z GITHUB_EVENT_NAME=push 2024-11-01T16:50:48.3495516Z GITHUB_RUN_ID=11632515612 2024-11-01T16:50:48.3496245Z GITHUB_STEP_SUMMARY=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/step_summary_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3496959Z GITHUB_ACTOR=pytorch-bot[bot] 2024-11-01T16:50:48.3497522Z PR_NUMBER= 2024-11-01T16:50:48.3497768Z GITHUB_RUN_ATTEMPT=1 2024-11-01T16:50:48.3498048Z ANACONDA_PYTHON_VERSION=3.10 2024-11-01T16:50:48.3498539Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2024-11-01T16:50:48.3498909Z TERM=vt100 2024-11-01T16:50:48.3499150Z INSTALLED_VISION=yes 2024-11-01T16:50:48.3499419Z BRANCH= 2024-11-01T16:50:48.3500195Z TESTS_TO_INCLUDE=test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor distributed/test_c10d_common distributed/test_c10d_nccl 2024-11-01T16:50:48.3501374Z GITHUB_ACTION_PATH=/home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2024-11-01T16:50:48.3501965Z GITHUB_SERVER_URL=https://github.com 2024-11-01T16:50:48.3502311Z PYTORCH_ROCM_ARCH=gfx906;gfx90a 2024-11-01T16:50:48.3502637Z REENABLED_ISSUES= 2024-11-01T16:50:48.3502893Z SHLVL=1 2024-11-01T16:50:48.3503120Z MAX_JOBS=62 2024-11-01T16:50:48.3503370Z GITHUB_ACTOR_ID=54816060 2024-11-01T16:50:48.3503749Z GITHUB_WORKFLOW_SHA=d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:50:48.3504184Z GITHUB_REF_NAME=ciflow/trunk/138766 2024-11-01T16:50:48.3504511Z ROCM_PATH=/opt/rocm 2024-11-01T16:50:48.3504960Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2024-11-01T16:50:48.3505424Z GITHUB_JOB=test 2024-11-01T16:50:48.3505688Z NO_TEST_TIMEOUT=False 2024-11-01T16:50:48.3505982Z GITHUB_REPOSITORY=pytorch/pytorch 2024-11-01T16:50:48.3506314Z LC_ALL=C.UTF-8 2024-11-01T16:50:48.3506574Z GITHUB_RETENTION_DAYS=90 2024-11-01T16:50:48.3506872Z GITHUB_ACTION_REPOSITORY= 2024-11-01T16:50:48.3507970Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-11-01T16:50:48.3508954Z GITHUB_BASE_REF= 2024-11-01T16:50:48.3509214Z CI=true 2024-11-01T16:50:48.3509460Z HSA_FORCE_FINE_GRAIN_PCIE=1 2024-11-01T16:50:48.3509767Z GITHUB_REPOSITORY_OWNER=pytorch 2024-11-01T16:50:48.3510084Z JOB_ID=32397140314 2024-11-01T16:50:48.3510344Z INSTALLED_PROTOBUF=yes 2024-11-01T16:50:48.3510624Z GITHUB_HEAD_REF= 2024-11-01T16:50:48.3510879Z GITHUB_ACTION_REF= 2024-11-01T16:50:48.3511238Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2024-11-01T16:50:48.3511617Z TEST_SHOWLOCALS=False 2024-11-01T16:50:48.3511891Z GITHUB_WORKFLOW=trunk 2024-11-01T16:50:48.3512175Z DEBIAN_FRONTEND=noninteractive 2024-11-01T16:50:48.3512904Z GITHUB_OUTPUT=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_output_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3513552Z NO_TD=False 2024-11-01T16:50:48.3513803Z OLDPWD=/var/lib/jenkins 2024-11-01T16:50:48.3514082Z _=/usr/bin/env 2024-11-01T16:50:48.3514484Z ++ python -c 'import site; print(site.getsitepackages()[0])' 2024-11-01T16:50:48.3658335Z + TORCH_INSTALL_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch 2024-11-01T16:50:48.3659774Z + TORCH_BIN_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/bin 2024-11-01T16:50:48.3661197Z + TORCH_LIB_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib 2024-11-01T16:50:48.3662537Z + TORCH_TEST_DIR=/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/test 2024-11-01T16:50:48.3663506Z + BUILD_DIR=build 2024-11-01T16:50:48.3664041Z + BUILD_RENAMED_DIR=build_renamed 2024-11-01T16:50:48.3664760Z + BUILD_BIN_DIR=build/bin 2024-11-01T16:50:48.3665399Z + SHARD_NUMBER=1 2024-11-01T16:50:48.3665977Z + NUM_TEST_SHARDS=2 2024-11-01T16:50:48.3666591Z + export VALGRIND=ON 2024-11-01T16:50:48.3667193Z + VALGRIND=ON 2024-11-01T16:50:48.3667968Z + [[ linux-focal-rocm6.2-py3.10 == *clang9* ]] 2024-11-01T16:50:48.3668782Z + [[ 0 == \1 ]] 2024-11-01T16:50:48.3669335Z + [[ False == \1 ]] 2024-11-01T16:50:48.3670050Z + [[ linux-focal-rocm6.2-py3.10 != *bazel* ]] 2024-11-01T16:50:48.3671635Z ++ realpath build/custom_test_artifacts 2024-11-01T16:50:48.3699119Z + CUSTOM_TEST_ARTIFACT_BUILD_DIR=/var/lib/jenkins/pytorch/build/custom_test_artifacts 2024-11-01T16:50:48.3709073Z + [[ -n test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor distributed/test_c10d_common distributed/test_c10d_nccl ]] 2024-11-01T16:50:48.3711014Z + echo 'Setting INCLUDE_CLAUSE' 2024-11-01T16:50:48.3711723Z Setting INCLUDE_CLAUSE 2024-11-01T16:50:48.3713867Z + INCLUDE_CLAUSE='--include test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor distributed/test_c10d_common distributed/test_c10d_nccl' 2024-11-01T16:50:48.3716294Z + echo 'Environment variables' 2024-11-01T16:50:48.3717037Z Environment variables 2024-11-01T16:50:48.3717633Z + env 2024-11-01T16:50:48.3724049Z INSTALLED_DB=yes 2024-11-01T16:50:48.3725397Z GITHUB_WORKSPACE=/home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-11-01T16:50:48.3726608Z AOTRITON_INSTALLED_PREFIX=/opt/rocm/aotriton 2024-11-01T16:50:48.3727469Z CONTINUE_THROUGH_ERROR=False 2024-11-01T16:50:48.3728387Z BUILD_ENVIRONMENT=linux-focal-rocm6.2-py3.10 2024-11-01T16:50:48.3729272Z HOSTNAME=ff4898059595 2024-11-01T16:50:48.3730885Z GITHUB_PATH=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/add_path_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3732447Z GITHUB_ACTION=__self 2024-11-01T16:50:48.3733128Z PYTORCH_TEST_CUDA_MEM_LEAK_CHECK=0 2024-11-01T16:50:48.3733889Z GITHUB_RUN_NUMBER=102766 2024-11-01T16:50:48.3734533Z TEST_CONFIG=default 2024-11-01T16:50:48.3735177Z GITHUB_REPOSITORY_OWNER_ID=21003710 2024-11-01T16:50:48.3736023Z AWS_DEFAULT_REGION=us-east-1 2024-11-01T16:50:48.3737357Z GITHUB_TRIGGERING_ACTOR=pytorch-bot[bot] 2024-11-01T16:50:48.3738398Z GITHUB_REF_TYPE=tag 2024-11-01T16:50:48.3739165Z *** 2024-11-01T16:50:48.3739690Z GITHUB_REPOSITORY_ID=65600975 2024-11-01T16:50:48.3740386Z GITHUB_ACTIONS=true 2024-11-01T16:50:48.3741081Z SHA1=d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:50:48.3742056Z GITHUB_SHA=d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:50:48.3743466Z GITHUB_WORKFLOW_REF=pytorch/pytorch/.github/workflows/trunk.yml@refs/tags/ciflow/trunk/138766 2024-11-01T16:50:48.3744749Z VERBOSE_TEST_LOGS=False 2024-11-01T16:50:48.3745443Z GITHUB_REF=refs/tags/ciflow/trunk/138766 2024-11-01T16:50:48.3746205Z SHARD_NUMBER=1 2024-11-01T16:50:48.3746796Z GITHUB_REF_PROTECTED=false 2024-11-01T16:50:48.3747449Z HOME=/var/lib/jenkins 2024-11-01T16:50:48.3748141Z GITHUB_API_URL=https://api.github.com 2024-11-01T16:50:48.3749073Z PYTORCH_TEST_RERUN_DISABLED_TESTS=0 2024-11-01T16:50:48.3749858Z LANG=C.UTF-8 2024-11-01T16:50:48.3750430Z PYTORCH_TEST_WITH_ROCM=1 2024-11-01T16:50:48.3751088Z NUM_TEST_SHARDS=2 2024-11-01T16:50:48.3752683Z GITHUB_STATE=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/save_state_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3754707Z JOB_NAME=linux-focal-rocm6.2-py3.10 / test (default, 1, 2, linux.rocm.gpu) 2024-11-01T16:50:48.3755842Z MAGMA_HOME=/opt/rocm/magma 2024-11-01T16:50:48.3757433Z GITHUB_ENV=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_env_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3758913Z HSAKMT_DEBUG_LEVEL=4 2024-11-01T16:50:48.3760134Z GITHUB_EVENT_PATH=/home/pytorchci/actions-runner/_work/_temp/_github_workflow/event.json 2024-11-01T16:50:48.3761350Z GITHUB_EVENT_NAME=push 2024-11-01T16:50:48.3762112Z GITHUB_RUN_ID=11632515612 2024-11-01T16:50:48.3764142Z GITHUB_STEP_SUMMARY=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/step_summary_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3765859Z GITHUB_ACTOR=pytorch-bot[bot] 2024-11-01T16:50:48.3766536Z PR_NUMBER= 2024-11-01T16:50:48.3767075Z GITHUB_RUN_ATTEMPT=1 2024-11-01T16:50:48.3767681Z VALGRIND=ON 2024-11-01T16:50:48.3768248Z ANACONDA_PYTHON_VERSION=3.10 2024-11-01T16:50:48.3769076Z GITHUB_GRAPHQL_URL=https://api.github.com/graphql 2024-11-01T16:50:48.3769928Z TERM=vt100 2024-11-01T16:50:48.3770458Z INSTALLED_VISION=yes 2024-11-01T16:50:48.3771054Z BRANCH= 2024-11-01T16:50:48.3772890Z TESTS_TO_INCLUDE=test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor distributed/test_c10d_common distributed/test_c10d_nccl 2024-11-01T16:50:48.3776036Z GITHUB_ACTION_PATH=/home/pytorchci/actions-runner/_work/pytorch/pytorch/./.github/actions/setup-rocm 2024-11-01T16:50:48.3777427Z GITHUB_SERVER_URL=https://github.com 2024-11-01T16:50:48.3778373Z PYTORCH_ROCM_ARCH=gfx906;gfx90a 2024-11-01T16:50:48.3779093Z REENABLED_ISSUES= 2024-11-01T16:50:48.3779660Z SHLVL=1 2024-11-01T16:50:48.3780162Z MAX_JOBS=62 2024-11-01T16:50:48.3780716Z GITHUB_ACTOR_ID=54816060 2024-11-01T16:50:48.3781615Z GITHUB_WORKFLOW_SHA=d1aa4ef5a6c0ee2d57fb0086f5557e34537f0fea 2024-11-01T16:50:48.3782783Z GITHUB_REF_NAME=ciflow/trunk/138766 2024-11-01T16:50:48.3783653Z ROCM_PATH=/opt/rocm 2024-11-01T16:50:48.3784883Z XLA_CLANG_CACHE_S3_BUCKET_NAME=ossci-compiler-clang-cache-circleci-xla 2024-11-01T16:50:48.3786147Z GITHUB_JOB=test 2024-11-01T16:50:48.3786809Z NO_TEST_TIMEOUT=False 2024-11-01T16:50:48.3787472Z GITHUB_REPOSITORY=pytorch/pytorch 2024-11-01T16:50:48.3788225Z LC_ALL=C.UTF-8 2024-11-01T16:50:48.3788801Z GITHUB_RETENTION_DAYS=90 2024-11-01T16:50:48.3789464Z GITHUB_ACTION_REPOSITORY= 2024-11-01T16:50:48.3791796Z PATH=/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-11-01T16:50:48.3794469Z GITHUB_BASE_REF= 2024-11-01T16:50:48.3795018Z CI=true 2024-11-01T16:50:48.3795557Z HSA_FORCE_FINE_GRAIN_PCIE=1 2024-11-01T16:50:48.3796561Z GITHUB_REPOSITORY_OWNER=pytorch 2024-11-01T16:50:48.3797275Z JOB_ID=32397140314 2024-11-01T16:50:48.3797866Z INSTALLED_PROTOBUF=yes 2024-11-01T16:50:48.3798468Z GITHUB_HEAD_REF= 2024-11-01T16:50:48.3799040Z GITHUB_ACTION_REF= 2024-11-01T16:50:48.3799863Z SCCACHE_BUCKET=ossci-compiler-cache-circleci-v2 2024-11-01T16:50:48.3800733Z TEST_SHOWLOCALS=False 2024-11-01T16:50:48.3801359Z GITHUB_WORKFLOW=trunk 2024-11-01T16:50:48.3802054Z DEBIAN_FRONTEND=noninteractive 2024-11-01T16:50:48.3804079Z GITHUB_OUTPUT=/home/pytorchci/actions-runner/_work/_temp/_runner_file_commands/set_output_a0e50af4-69c2-428e-a0ae-e380509c2690 2024-11-01T16:50:48.3805683Z NO_TD=False 2024-11-01T16:50:48.3806236Z OLDPWD=/var/lib/jenkins 2024-11-01T16:50:48.3806851Z _=/usr/bin/env 2024-11-01T16:50:48.3807485Z + echo 'Testing pytorch' 2024-11-01T16:50:48.3808129Z Testing pytorch 2024-11-01T16:50:48.3808739Z + export LANG=C.UTF-8 2024-11-01T16:50:48.3809373Z + LANG=C.UTF-8 2024-11-01T16:50:48.3809922Z + PR_NUMBER= 2024-11-01T16:50:48.3810514Z + [[ default == \d\e\f\a\u\l\t ]] 2024-11-01T16:50:48.3811275Z + export CUDA_VISIBLE_DEVICES=0 2024-11-01T16:50:48.3811999Z + CUDA_VISIBLE_DEVICES=0 2024-11-01T16:50:48.3812670Z + export HIP_VISIBLE_DEVICES=0 2024-11-01T16:50:48.3813361Z + HIP_VISIBLE_DEVICES=0 2024-11-01T16:50:48.3814053Z + [[ default == \d\i\s\t\r\i\b\u\t\e\d ]] 2024-11-01T16:50:48.3814833Z + [[ default == \s\l\o\w ]] 2024-11-01T16:50:48.3815760Z + [[ linux-focal-rocm6.2-py3.10 == *slow-gradcheck* ]] 2024-11-01T16:50:48.3816849Z + [[ linux-focal-rocm6.2-py3.10 == *cuda* ]] 2024-11-01T16:50:48.3817814Z + [[ linux-focal-rocm6.2-py3.10 == *rocm* ]] 2024-11-01T16:50:48.3818878Z + export PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2024-11-01T16:50:48.3819777Z + PYTORCH_TESTING_DEVICE_ONLY_FOR=cuda 2024-11-01T16:50:48.3820572Z + [[ default == *crossref* ]] 2024-11-01T16:50:48.3821428Z + [[ linux-focal-rocm6.2-py3.10 == *rocm* ]] 2024-11-01T16:50:48.3822379Z + export VALGRIND=OFF 2024-11-01T16:50:48.3823094Z + VALGRIND=OFF 2024-11-01T16:50:48.3823742Z + rocminfo 2024-11-01T16:50:48.3901364Z ROCk module version 6.8.5 is loaded 2024-11-01T16:50:48.4518426Z ===================== 2024-11-01T16:50:48.4519245Z HSA System Attributes 2024-11-01T16:50:48.4519932Z ===================== 2024-11-01T16:50:48.4520594Z Runtime Version: 1.14 2024-11-01T16:50:48.4521332Z Runtime Ext Version: 1.6 2024-11-01T16:50:48.4522146Z System Timestamp Freq.: 1000.000000MHz 2024-11-01T16:50:48.4524417Z Sig. Max Wait Duration: 18446744073709551615 (0xFFFFFFFFFFFFFFFF) (timestamp count) 2024-11-01T16:50:48.4525836Z Machine Model: LARGE 2024-11-01T16:50:48.4526919Z System Endianness: LITTLE 2024-11-01T16:50:48.4527876Z Mwaitx: DISABLED 2024-11-01T16:50:48.4528609Z DMAbuf Support: YES 2024-11-01T16:50:48.4529091Z 2024-11-01T16:50:48.4529342Z ========== 2024-11-01T16:50:48.4529995Z HSA Agents 2024-11-01T16:50:48.4530638Z ========== 2024-11-01T16:50:48.4531239Z ******* 2024-11-01T16:50:48.4531855Z Agent 1 2024-11-01T16:50:48.4532455Z ******* 2024-11-01T16:50:48.4533492Z Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:50:48.4534645Z Uuid: CPU-XX 2024-11-01T16:50:48.4535819Z Marketing Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:50:48.4536931Z Vendor Name: CPU 2024-11-01T16:50:48.4538629Z Feature: None specified 2024-11-01T16:50:48.4539883Z Profile: FULL_PROFILE 2024-11-01T16:50:48.4541113Z Float Round Mode: NEAR 2024-11-01T16:50:48.4542372Z Max Queue Number: 0(0x0) 2024-11-01T16:50:48.4543602Z Queue Min Size: 0(0x0) 2024-11-01T16:50:48.4544984Z Queue Max Size: 0(0x0) 2024-11-01T16:50:48.4546001Z Queue Type: MULTI 2024-11-01T16:50:48.4546947Z Node: 0 2024-11-01T16:50:48.4547893Z Device Type: CPU 2024-11-01T16:50:48.4548773Z Cache Info: 2024-11-01T16:50:48.4549824Z L1: 32768(0x8000) KB 2024-11-01T16:50:48.4550818Z Chip ID: 0(0x0) 2024-11-01T16:50:48.4551825Z ASIC Revision: 0(0x0) 2024-11-01T16:50:48.4552866Z Cacheline Size: 64(0x40) 2024-11-01T16:50:48.4553912Z Max Clock Freq. (MHz): 2600 2024-11-01T16:50:48.4554896Z BDFID: 0 2024-11-01T16:50:48.4555889Z Internal Node ID: 0 2024-11-01T16:50:48.4556898Z Compute Unit: 32 2024-11-01T16:50:48.4557877Z SIMDs per CU: 0 2024-11-01T16:50:48.4558908Z Shader Engines: 0 2024-11-01T16:50:48.4560166Z Shader Arrs. per Eng.: 0 2024-11-01T16:50:48.4561486Z WatchPts on Addr. Ranges:1 2024-11-01T16:50:48.4562480Z Memory Properties: 2024-11-01T16:50:48.4563167Z Features: None 2024-11-01T16:50:48.4563871Z Pool Info: 2024-11-01T16:50:48.4564529Z Pool 1 2024-11-01T16:50:48.4565400Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-11-01T16:50:48.4566459Z Size: 65790800(0x3ebe350) KB 2024-11-01T16:50:48.4567505Z Allocatable: TRUE 2024-11-01T16:50:48.4568584Z Alloc Granule: 4KB 2024-11-01T16:50:48.4569719Z Alloc Recommended Granule:4KB 2024-11-01T16:50:48.4570845Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4571933Z Accessible by all: TRUE 2024-11-01T16:50:48.4572833Z Pool 2 2024-11-01T16:50:48.4574124Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-11-01T16:50:48.4575213Z Size: 65790800(0x3ebe350) KB 2024-11-01T16:50:48.4576262Z Allocatable: TRUE 2024-11-01T16:50:48.4577327Z Alloc Granule: 4KB 2024-11-01T16:50:48.4578610Z Alloc Recommended Granule:4KB 2024-11-01T16:50:48.4579721Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4580819Z Accessible by all: TRUE 2024-11-01T16:50:48.4581746Z Pool 3 2024-11-01T16:50:48.4582749Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-11-01T16:50:48.4583984Z Size: 65790800(0x3ebe350) KB 2024-11-01T16:50:48.4585205Z Allocatable: TRUE 2024-11-01T16:50:48.4586454Z Alloc Granule: 4KB 2024-11-01T16:50:48.4587751Z Alloc Recommended Granule:4KB 2024-11-01T16:50:48.4588857Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4589922Z Accessible by all: TRUE 2024-11-01T16:50:48.4590826Z ISA Info: 2024-11-01T16:50:48.4591492Z ******* 2024-11-01T16:50:48.4592155Z Agent 2 2024-11-01T16:50:48.4592784Z ******* 2024-11-01T16:50:48.4594043Z Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:50:48.4595207Z Uuid: CPU-XX 2024-11-01T16:50:48.4596366Z Marketing Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:50:48.4597426Z Vendor Name: CPU 2024-11-01T16:50:48.4598438Z Feature: None specified 2024-11-01T16:50:48.4599441Z Profile: FULL_PROFILE 2024-11-01T16:50:48.4600469Z Float Round Mode: NEAR 2024-11-01T16:50:48.4601509Z Max Queue Number: 0(0x0) 2024-11-01T16:50:48.4602673Z Queue Min Size: 0(0x0) 2024-11-01T16:50:48.4603856Z Queue Max Size: 0(0x0) 2024-11-01T16:50:48.4605006Z Queue Type: MULTI 2024-11-01T16:50:48.4605965Z Node: 1 2024-11-01T16:50:48.4606950Z Device Type: CPU 2024-11-01T16:50:48.4607853Z Cache Info: 2024-11-01T16:50:48.4608644Z L1: 32768(0x8000) KB 2024-11-01T16:50:48.4609623Z Chip ID: 0(0x0) 2024-11-01T16:50:48.4610622Z ASIC Revision: 0(0x0) 2024-11-01T16:50:48.4611639Z Cacheline Size: 64(0x40) 2024-11-01T16:50:48.4612655Z Max Clock Freq. (MHz): 2600 2024-11-01T16:50:48.4613644Z BDFID: 0 2024-11-01T16:50:48.4614619Z Internal Node ID: 1 2024-11-01T16:50:48.4615618Z Compute Unit: 32 2024-11-01T16:50:48.4616607Z SIMDs per CU: 0 2024-11-01T16:50:48.4617605Z Shader Engines: 0 2024-11-01T16:50:48.4618779Z Shader Arrs. per Eng.: 0 2024-11-01T16:50:48.4619848Z WatchPts on Addr. Ranges:1 2024-11-01T16:50:48.4620766Z Memory Properties: 2024-11-01T16:50:48.4621456Z Features: None 2024-11-01T16:50:48.4622578Z Pool Info: 2024-11-01T16:50:48.4623358Z Pool 1 2024-11-01T16:50:48.4624375Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-11-01T16:50:48.4625620Z Size: 66046476(0x3efca0c) KB 2024-11-01T16:50:48.4626860Z Allocatable: TRUE 2024-11-01T16:50:48.4628056Z Alloc Granule: 4KB 2024-11-01T16:50:48.4629173Z Alloc Recommended Granule:4KB 2024-11-01T16:50:48.4630317Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4631392Z Accessible by all: TRUE 2024-11-01T16:50:48.4632300Z Pool 2 2024-11-01T16:50:48.4633161Z Segment: GLOBAL; FLAGS: KERNARG, FINE GRAINED 2024-11-01T16:50:48.4634187Z Size: 66046476(0x3efca0c) KB 2024-11-01T16:50:48.4635210Z Allocatable: TRUE 2024-11-01T16:50:48.4636246Z Alloc Granule: 4KB 2024-11-01T16:50:48.4637334Z Alloc Recommended Granule:4KB 2024-11-01T16:50:48.4638419Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4639472Z Accessible by all: TRUE 2024-11-01T16:50:48.4640371Z Pool 3 2024-11-01T16:50:48.4641489Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-11-01T16:50:48.4642747Z Size: 66046476(0x3efca0c) KB 2024-11-01T16:50:48.4643920Z Allocatable: TRUE 2024-11-01T16:50:48.4645099Z Alloc Granule: 4KB 2024-11-01T16:50:48.4646182Z Alloc Recommended Granule:4KB 2024-11-01T16:50:48.4647280Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4648335Z Accessible by all: TRUE 2024-11-01T16:50:48.4649219Z ISA Info: 2024-11-01T16:50:48.4649868Z ******* 2024-11-01T16:50:48.4650487Z Agent 3 2024-11-01T16:50:48.4651113Z ******* 2024-11-01T16:50:48.4651854Z Name: gfx90a 2024-11-01T16:50:48.4652995Z Uuid: GPU-cc050612e1017d6b 2024-11-01T16:50:48.4654040Z Marketing Name: AMD Instinct MI210 2024-11-01T16:50:48.4655073Z Vendor Name: AMD 2024-11-01T16:50:48.4656085Z Feature: KERNEL_DISPATCH 2024-11-01T16:50:48.4657078Z Profile: BASE_PROFILE 2024-11-01T16:50:48.4658265Z Float Round Mode: NEAR 2024-11-01T16:50:48.4659311Z Max Queue Number: 128(0x80) 2024-11-01T16:50:48.4660338Z Queue Min Size: 64(0x40) 2024-11-01T16:50:48.4661346Z Queue Max Size: 131072(0x20000) 2024-11-01T16:50:48.4662424Z Queue Type: MULTI 2024-11-01T16:50:48.4663536Z Node: 2 2024-11-01T16:50:48.4664667Z Device Type: GPU 2024-11-01T16:50:48.4665684Z Cache Info: 2024-11-01T16:50:48.4666607Z L1: 16(0x10) KB 2024-11-01T16:50:48.4667728Z L2: 8192(0x2000) KB 2024-11-01T16:50:48.4668722Z Chip ID: 29711(0x740f) 2024-11-01T16:50:48.4669734Z ASIC Revision: 1(0x1) 2024-11-01T16:50:48.4671093Z Cacheline Size: 64(0x40) 2024-11-01T16:50:48.4672119Z Max Clock Freq. (MHz): 1700 2024-11-01T16:50:48.4673103Z BDFID: 768 2024-11-01T16:50:48.4674082Z Internal Node ID: 2 2024-11-01T16:50:48.4675104Z Compute Unit: 104 2024-11-01T16:50:48.4676102Z SIMDs per CU: 4 2024-11-01T16:50:48.4677111Z Shader Engines: 8 2024-11-01T16:50:48.4678142Z Shader Arrs. per Eng.: 1 2024-11-01T16:50:48.4679201Z WatchPts on Addr. Ranges:4 2024-11-01T16:50:48.4680276Z Coherent Host Access: FALSE 2024-11-01T16:50:48.4681188Z Memory Properties: 2024-11-01T16:50:48.4681946Z Features: KERNEL_DISPATCH 2024-11-01T16:50:48.4682935Z Fast F16 Operation: TRUE 2024-11-01T16:50:48.4683994Z Wavefront Size: 64(0x40) 2024-11-01T16:50:48.4685040Z Workgroup Max Size: 1024(0x400) 2024-11-01T16:50:48.4685982Z Workgroup Max Size per Dimension: 2024-11-01T16:50:48.4686848Z x 1024(0x400) 2024-11-01T16:50:48.4687770Z y 1024(0x400) 2024-11-01T16:50:48.4688957Z z 1024(0x400) 2024-11-01T16:50:48.4689951Z Max Waves Per CU: 32(0x20) 2024-11-01T16:50:48.4691149Z Max Work-item Per CU: 2048(0x800) 2024-11-01T16:50:48.4692222Z Grid Max Size: 4294967295(0xffffffff) 2024-11-01T16:50:48.4693145Z Grid Max Size per Dimension: 2024-11-01T16:50:48.4693959Z x 4294967295(0xffffffff) 2024-11-01T16:50:48.4694871Z y 4294967295(0xffffffff) 2024-11-01T16:50:48.4695771Z z 4294967295(0xffffffff) 2024-11-01T16:50:48.4696807Z Max fbarriers/Workgrp: 32 2024-11-01T16:50:48.4697928Z Packet Processor uCode:: 83 2024-11-01T16:50:48.4699206Z SDMA engine uCode:: 8 2024-11-01T16:50:48.4700275Z IOMMU Support:: None 2024-11-01T16:50:48.4701164Z Pool Info: 2024-11-01T16:50:48.4701831Z Pool 1 2024-11-01T16:50:48.4702711Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-11-01T16:50:48.4703767Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:50:48.4704807Z Allocatable: TRUE 2024-11-01T16:50:48.4705891Z Alloc Granule: 4KB 2024-11-01T16:50:48.4707041Z Alloc Recommended Granule:2048KB 2024-11-01T16:50:48.4708181Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4709288Z Accessible by all: FALSE 2024-11-01T16:50:48.4710215Z Pool 2 2024-11-01T16:50:48.4711076Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2024-11-01T16:50:48.4712131Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:50:48.4713149Z Allocatable: TRUE 2024-11-01T16:50:48.4714208Z Alloc Granule: 4KB 2024-11-01T16:50:48.4715302Z Alloc Recommended Granule:2048KB 2024-11-01T16:50:48.4716404Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4717842Z Accessible by all: FALSE 2024-11-01T16:50:48.4718765Z Pool 3 2024-11-01T16:50:48.4719598Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-11-01T16:50:48.4720638Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:50:48.4721651Z Allocatable: TRUE 2024-11-01T16:50:48.4722715Z Alloc Granule: 4KB 2024-11-01T16:50:48.4723819Z Alloc Recommended Granule:2048KB 2024-11-01T16:50:48.4724944Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4726013Z Accessible by all: FALSE 2024-11-01T16:50:48.4726922Z Pool 4 2024-11-01T16:50:48.4727722Z Segment: GROUP 2024-11-01T16:50:48.4728715Z Size: 64(0x40) KB 2024-11-01T16:50:48.4729727Z Allocatable: FALSE 2024-11-01T16:50:48.4730759Z Alloc Granule: 0KB 2024-11-01T16:50:48.4731856Z Alloc Recommended Granule:0KB 2024-11-01T16:50:48.4732960Z Alloc Alignment: 0KB 2024-11-01T16:50:48.4734019Z Accessible by all: FALSE 2024-11-01T16:50:48.4734913Z ISA Info: 2024-11-01T16:50:48.4735874Z ISA 1 2024-11-01T16:50:48.4736924Z Name: amdgcn-amd-amdhsa--gfx90a:sramecc+:xnack- 2024-11-01T16:50:48.4738074Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-11-01T16:50:48.4739309Z Profiles: HSA_PROFILE_BASE 2024-11-01T16:50:48.4740397Z Default Rounding Mode: NEAR 2024-11-01T16:50:48.4741518Z Default Rounding Mode: NEAR 2024-11-01T16:50:48.4742582Z Fast f16: TRUE 2024-11-01T16:50:48.4743647Z Workgroup Max Size: 1024(0x400) 2024-11-01T16:50:48.4744635Z Workgroup Max Size per Dimension: 2024-11-01T16:50:48.4745534Z x 1024(0x400) 2024-11-01T16:50:48.4746495Z y 1024(0x400) 2024-11-01T16:50:48.4747451Z z 1024(0x400) 2024-11-01T16:50:48.4748464Z Grid Max Size: 4294967295(0xffffffff) 2024-11-01T16:50:48.4749414Z Grid Max Size per Dimension: 2024-11-01T16:50:48.4750247Z x 4294967295(0xffffffff) 2024-11-01T16:50:48.4751179Z y 4294967295(0xffffffff) 2024-11-01T16:50:48.4752135Z z 4294967295(0xffffffff) 2024-11-01T16:50:48.4753157Z FBarrier Max Size: 32 2024-11-01T16:50:48.4754056Z ******* 2024-11-01T16:50:48.4754680Z Agent 4 2024-11-01T16:50:48.4755300Z ******* 2024-11-01T16:50:48.4756043Z Name: gfx90a 2024-11-01T16:50:48.4757174Z Uuid: GPU-f7f1beca7bad9e24 2024-11-01T16:50:48.4758243Z Marketing Name: AMD Instinct MI210 2024-11-01T16:50:48.4759270Z Vendor Name: AMD 2024-11-01T16:50:48.4760301Z Feature: KERNEL_DISPATCH 2024-11-01T16:50:48.4761361Z Profile: BASE_PROFILE 2024-11-01T16:50:48.4762468Z Float Round Mode: NEAR 2024-11-01T16:50:48.4763902Z Max Queue Number: 128(0x80) 2024-11-01T16:50:48.4764952Z Queue Min Size: 64(0x40) 2024-11-01T16:50:48.4765993Z Queue Max Size: 131072(0x20000) 2024-11-01T16:50:48.4767006Z Queue Type: MULTI 2024-11-01T16:50:48.4767971Z Node: 3 2024-11-01T16:50:48.4768941Z Device Type: GPU 2024-11-01T16:50:48.4769830Z Cache Info: 2024-11-01T16:50:48.4770630Z L1: 16(0x10) KB 2024-11-01T16:50:48.4771586Z L2: 8192(0x2000) KB 2024-11-01T16:50:48.4772575Z Chip ID: 29711(0x740f) 2024-11-01T16:50:48.4773596Z ASIC Revision: 1(0x1) 2024-11-01T16:50:48.4774642Z Cacheline Size: 64(0x40) 2024-11-01T16:50:48.4775702Z Max Clock Freq. (MHz): 1700 2024-11-01T16:50:48.4776696Z BDFID: 33536 2024-11-01T16:50:48.4777689Z Internal Node ID: 3 2024-11-01T16:50:48.4778857Z Compute Unit: 104 2024-11-01T16:50:48.4779846Z SIMDs per CU: 4 2024-11-01T16:50:48.4780854Z Shader Engines: 8 2024-11-01T16:50:48.4782226Z Shader Arrs. per Eng.: 1 2024-11-01T16:50:48.4783347Z WatchPts on Addr. Ranges:4 2024-11-01T16:50:48.4784422Z Coherent Host Access: FALSE 2024-11-01T16:50:48.4785344Z Memory Properties: 2024-11-01T16:50:48.4786101Z Features: KERNEL_DISPATCH 2024-11-01T16:50:48.4787085Z Fast F16 Operation: TRUE 2024-11-01T16:50:48.4788126Z Wavefront Size: 64(0x40) 2024-11-01T16:50:48.4789163Z Workgroup Max Size: 1024(0x400) 2024-11-01T16:50:48.4790089Z Workgroup Max Size per Dimension: 2024-11-01T16:50:48.4790946Z x 1024(0x400) 2024-11-01T16:50:48.4791864Z y 1024(0x400) 2024-11-01T16:50:48.4792768Z z 1024(0x400) 2024-11-01T16:50:48.4793767Z Max Waves Per CU: 32(0x20) 2024-11-01T16:50:48.4794977Z Max Work-item Per CU: 2048(0x800) 2024-11-01T16:50:48.4796051Z Grid Max Size: 4294967295(0xffffffff) 2024-11-01T16:50:48.4796964Z Grid Max Size per Dimension: 2024-11-01T16:50:48.4797741Z x 4294967295(0xffffffff) 2024-11-01T16:50:48.4798677Z y 4294967295(0xffffffff) 2024-11-01T16:50:48.4799642Z z 4294967295(0xffffffff) 2024-11-01T16:50:48.4800669Z Max fbarriers/Workgrp: 32 2024-11-01T16:50:48.4801767Z Packet Processor uCode:: 83 2024-11-01T16:50:48.4802864Z SDMA engine uCode:: 8 2024-11-01T16:50:48.4803895Z IOMMU Support:: None 2024-11-01T16:50:48.4804766Z Pool Info: 2024-11-01T16:50:48.4805452Z Pool 1 2024-11-01T16:50:48.4806323Z Segment: GLOBAL; FLAGS: COARSE GRAINED 2024-11-01T16:50:48.4807392Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:50:48.4808437Z Allocatable: TRUE 2024-11-01T16:50:48.4809507Z Alloc Granule: 4KB 2024-11-01T16:50:48.4810959Z Alloc Recommended Granule:2048KB 2024-11-01T16:50:48.4812086Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4813192Z Accessible by all: FALSE 2024-11-01T16:50:48.4814125Z Pool 2 2024-11-01T16:50:48.4814995Z Segment: GLOBAL; FLAGS: EXTENDED FINE GRAINED 2024-11-01T16:50:48.4816041Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:50:48.4817072Z Allocatable: TRUE 2024-11-01T16:50:48.4818289Z Alloc Granule: 4KB 2024-11-01T16:50:48.4819395Z Alloc Recommended Granule:2048KB 2024-11-01T16:50:48.4820494Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4821557Z Accessible by all: FALSE 2024-11-01T16:50:48.4822471Z Pool 3 2024-11-01T16:50:48.4823309Z Segment: GLOBAL; FLAGS: FINE GRAINED 2024-11-01T16:50:48.4824334Z Size: 67092480(0x3ffc000) KB 2024-11-01T16:50:48.4825376Z Allocatable: TRUE 2024-11-01T16:50:48.4826411Z Alloc Granule: 4KB 2024-11-01T16:50:48.4827505Z Alloc Recommended Granule:2048KB 2024-11-01T16:50:48.4828920Z Alloc Alignment: 4KB 2024-11-01T16:50:48.4830003Z Accessible by all: FALSE 2024-11-01T16:50:48.4830916Z Pool 4 2024-11-01T16:50:48.4831716Z Segment: GROUP 2024-11-01T16:50:48.4832709Z Size: 64(0x40) KB 2024-11-01T16:50:48.4833731Z Allocatable: FALSE 2024-11-01T16:50:48.4834763Z Alloc Granule: 0KB 2024-11-01T16:50:48.4835851Z Alloc Recommended Granule:0KB 2024-11-01T16:50:48.4836963Z Alloc Alignment: 0KB 2024-11-01T16:50:48.4838022Z Accessible by all: FALSE 2024-11-01T16:50:48.4838926Z ISA Info: 2024-11-01T16:50:48.4839588Z ISA 1 2024-11-01T16:50:48.4840620Z Name: amdgcn-amd-amdhsa--gfx90a:sramecc+:xnack- 2024-11-01T16:50:48.4841770Z Machine Models: HSA_MACHINE_MODEL_LARGE 2024-11-01T16:50:48.4842863Z Profiles: HSA_PROFILE_BASE 2024-11-01T16:50:48.4843950Z Default Rounding Mode: NEAR 2024-11-01T16:50:48.4845062Z Default Rounding Mode: NEAR 2024-11-01T16:50:48.4846123Z Fast f16: TRUE 2024-11-01T16:50:48.4847177Z Workgroup Max Size: 1024(0x400) 2024-11-01T16:50:48.4848180Z Workgroup Max Size per Dimension: 2024-11-01T16:50:48.4849070Z x 1024(0x400) 2024-11-01T16:50:48.4850026Z y 1024(0x400) 2024-11-01T16:50:48.4850959Z z 1024(0x400) 2024-11-01T16:50:48.4851994Z Grid Max Size: 4294967295(0xffffffff) 2024-11-01T16:50:48.4852936Z Grid Max Size per Dimension: 2024-11-01T16:50:48.4853754Z x 4294967295(0xffffffff) 2024-11-01T16:50:48.4854705Z y 4294967295(0xffffffff) 2024-11-01T16:50:48.4855647Z z 4294967295(0xffffffff) 2024-11-01T16:50:48.4857000Z FBarrier Max Size: 32 2024-11-01T16:50:48.4857913Z *** Done *** 2024-11-01T16:50:48.4858736Z + rocminfo 2024-11-01T16:50:48.4859403Z + grep -E 'Name:.*\sgfx|Marketing' 2024-11-01T16:50:48.5352364Z Marketing Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:50:48.5353798Z Marketing Name: AMD EPYC 7513 32-Core Processor 2024-11-01T16:50:48.5354872Z Name: gfx90a 2024-11-01T16:50:48.5355922Z Marketing Name: AMD Instinct MI210 2024-11-01T16:50:48.5356779Z Name: gfx90a 2024-11-01T16:50:48.5357332Z Marketing Name: AMD Instinct MI210 2024-11-01T16:50:48.5555967Z + [[ linux-focal-rocm6.2-py3.10 == *xpu* ]] 2024-11-01T16:50:48.5557220Z + [[ linux-focal-rocm6.2-py3.10 != *-bazel-* ]] 2024-11-01T16:50:48.5558172Z + pip_install --user ninja==1.10.2 2024-11-01T16:50:48.5559179Z + pip install --progress-bar off --user ninja==1.10.2 2024-11-01T16:50:48.8889954Z Collecting ninja==1.10.2 2024-11-01T16:50:48.9120701Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl.metadata (5.0 kB) 2024-11-01T16:50:48.9231366Z Downloading ninja-1.10.2-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB) 2024-11-01T16:50:49.1799620Z Installing collected packages: ninja 2024-11-01T16:50:49.1868497Z  WARNING: The script ninja is installed in '/var/lib/jenkins/.local/bin' which is not on PATH. 2024-11-01T16:50:49.1871516Z Consider adding this directory to PATH or, if you prefer to suppress this warning, use --no-warn-script-location. 2024-11-01T16:50:49.1898551Z Successfully installed ninja-1.10.2 2024-11-01T16:50:49.2511952Z + export PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-11-01T16:50:49.2516986Z + PATH=/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-11-01T16:50:49.2519934Z + [[ linux-focal-rocm6.2-py3.10 == *aarch64* ]] 2024-11-01T16:50:49.2520786Z + install_tlparse 2024-11-01T16:50:49.2521509Z + pip_install --user tlparse==0.3.25 2024-11-01T16:50:49.2522555Z + pip install --progress-bar off --user tlparse==0.3.25 2024-11-01T16:50:49.5934463Z Collecting tlparse==0.3.25 2024-11-01T16:50:49.6148319Z Downloading tlparse-0.3.25-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (1.7 kB) 2024-11-01T16:50:49.6237589Z Downloading tlparse-0.3.25-py3-none-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (2.2 MB) 2024-11-01T16:50:49.9632152Z Installing collected packages: tlparse 2024-11-01T16:50:49.9934407Z Successfully installed tlparse-0.3.25 2024-11-01T16:50:50.0599399Z ++ python -m site --user-base 2024-11-01T16:50:50.0833508Z + PATH=/var/lib/jenkins/.local/bin:/var/lib/jenkins/.local/bin:/opt/cache/bin:/opt/rocm/llvm/bin:/opt/rocm/opencl/bin:/opt/rocm/hip/bin:/opt/rocm/hcc/bin:/opt/rocm/bin:/opt/conda/envs/py_3.10/bin:/opt/conda/bin:/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin 2024-11-01T16:50:50.0836858Z + [[ linux-focal-rocm6.2-py3.10 == *asan* ]] 2024-11-01T16:50:50.0837904Z + [[ linux-focal-rocm6.2-py3.10 == *-debug* ]] 2024-11-01T16:50:50.0838935Z + [[ linux-focal-rocm6.2-py3.10 != *-bazel-* ]] 2024-11-01T16:50:50.0840449Z + echo 'We are not in debug mode: linux-focal-rocm6.2-py3.10. Expect the assertion to pass' 2024-11-01T16:50:50.0842247Z We are not in debug mode: linux-focal-rocm6.2-py3.10. Expect the assertion to pass 2024-11-01T16:50:50.0843629Z + cd test 2024-11-01T16:50:50.0844821Z + python -c 'import torch; torch._C._crash_if_debug_asserts_fail(424242)' 2024-11-01T16:50:52.9773780Z + [[ default == \n\o\g\p\u\_\N\O\_\A\V\X\2 ]] 2024-11-01T16:50:52.9774802Z + [[ default == \n\o\g\p\u\_\A\V\X\5\1\2 ]] 2024-11-01T16:50:52.9778771Z + DYNAMO_BENCHMARK_FLAGS=() 2024-11-01T16:50:52.9779604Z + [[ default == *pr_time_benchmarks* ]] 2024-11-01T16:50:52.9780422Z + [[ default == *dynamo_eager* ]] 2024-11-01T16:50:52.9781246Z + [[ default == *aot_eager* ]] 2024-11-01T16:50:52.9781994Z + [[ default == *aot_inductor* ]] 2024-11-01T16:50:52.9782750Z + [[ default == *inductor* ]] 2024-11-01T16:50:52.9783460Z + [[ default == *dynamic* ]] 2024-11-01T16:50:52.9784165Z + [[ default == *cpu* ]] 2024-11-01T16:50:52.9785144Z + DYNAMO_BENCHMARK_FLAGS+=(--device cuda) 2024-11-01T16:50:52.9824968Z + [[ linux-focal-rocm6.2-py3.10 == *libtorch* ]] 2024-11-01T16:50:52.9826203Z + [[ linux-focal-rocm6.2-py3.10 == *-bazel-* ]] 2024-11-01T16:50:52.9830622Z + cd test 2024-11-01T16:50:52.9831847Z + python -c 'import torch; print(torch.__config__.show())' 2024-11-01T16:50:55.2985274Z PyTorch built with: 2024-11-01T16:50:55.2986273Z - GCC 9.4 2024-11-01T16:50:55.2986944Z - C++ Version: 201703 2024-11-01T16:50:55.2988558Z - Intel(R) oneAPI Math Kernel Library Version 2021.4-Product Build 20210904 for Intel(R) 64 architecture applications 2024-11-01T16:50:55.2990603Z - Intel(R) MKL-DNN v3.5.3 (Git Hash 66f0cb9eb66affd2da3bf5f8d897376f04aae6af) 2024-11-01T16:50:55.2991833Z - OpenMP 201511 (a.k.a. OpenMP 4.5) 2024-11-01T16:50:55.2992793Z - LAPACK is enabled (usually provided by MKL) 2024-11-01T16:50:55.2993682Z - NNPACK is enabled 2024-11-01T16:50:55.2994410Z - CPU capability usage: AVX2 2024-11-01T16:50:55.2996384Z - HIP Runtime 6.2.41133 2024-11-01T16:50:55.2997193Z - MIOpen 3.2.0 2024-11-01T16:50:55.2997826Z - Magma 2.7.2 2024-11-01T16:50:55.3009636Z - Build settings: BLAS_INFO=mkl, BUILD_TYPE=Release, CXX_COMPILER=/opt/cache/bin/c++, CXX_FLAGS= -D_GLIBCXX_USE_CXX11_ABI=1 -fvisibility-inlines-hidden -DUSE_PTHREADPOOL -DNDEBUG -DUSE_KINETO -DLIBKINETO_NOCUPTI -DLIBKINETO_NOXPUPTI=ON -DUSE_FBGEMM -DUSE_PYTORCH_QNNPACK -DUSE_XNNPACK -DSYMBOLICATE_MOBILE_DEBUG_HANDLE -O2 -fPIC -Wall -Wextra -Werror=return-type -Werror=non-virtual-dtor -Werror=bool-operation -Wnarrowing -Wno-missing-field-initializers -Wno-type-limits -Wno-array-bounds -Wno-unknown-pragmas -Wno-unused-parameter -Wno-strict-overflow -Wno-strict-aliasing -Wno-stringop-overflow -Wsuggest-override -Wno-psabi -Wno-error=old-style-cast -Wno-missing-braces -fdiagnostics-color=always -faligned-new -Wno-unused-but-set-variable -Wno-maybe-uninitialized -fno-math-errno -fno-trapping-math -Werror=format -Wno-stringop-overflow, LAPACK_INFO=mkl, PERF_WITH_AVX=1, PERF_WITH_AVX2=1, TORCH_VERSION=2.6.0, USE_CUDA=OFF, USE_CUDNN=OFF, USE_CUSPARSELT=OFF, USE_EXCEPTION_PTR=1, USE_GFLAGS=OFF, USE_GLOG=OFF, USE_GLOO=ON, USE_MKL=ON, USE_MKLDNN=ON, USE_MPI=OFF, USE_NCCL=ON, USE_NNPACK=ON, USE_OPENMP=ON, USE_ROCM=ON, USE_ROCM_KERNEL_ASSERT=OFF, 2024-11-01T16:50:55.3020674Z 2024-11-01T16:50:55.9189927Z + cd test 2024-11-01T16:50:55.9191270Z + python -c 'import torch; print(torch.__config__.parallel_info())' 2024-11-01T16:50:57.1740040Z ATen/Parallel: 2024-11-01T16:50:57.1740882Z at::get_num_threads() : 64 2024-11-01T16:50:57.1741649Z at::get_num_interop_threads() : 64 2024-11-01T16:50:57.1742486Z OpenMP 201511 (a.k.a. OpenMP 4.5) 2024-11-01T16:50:57.1743310Z omp_get_max_threads() : 64 2024-11-01T16:50:57.1745093Z Intel(R) oneAPI Math Kernel Library Version 2021.4-Product Build 20210904 for Intel(R) 64 architecture applications 2024-11-01T16:50:57.1746574Z mkl_get_max_threads() : 64 2024-11-01T16:50:57.1747775Z Intel(R) MKL-DNN v3.5.3 (Git Hash 66f0cb9eb66affd2da3bf5f8d897376f04aae6af) 2024-11-01T16:50:57.1748968Z std::thread::hardware_concurrency() : 64 2024-11-01T16:50:57.1749787Z Environment variables: 2024-11-01T16:50:57.1750444Z OMP_NUM_THREADS : [not set] 2024-11-01T16:50:57.1751136Z MKL_NUM_THREADS : [not set] 2024-11-01T16:50:57.1751831Z ATen parallel backend: OpenMP 2024-11-01T16:50:57.1752308Z 2024-11-01T16:50:58.7704919Z + [[ linux-focal-rocm6.2-py3.10 == *aarch64* ]] 2024-11-01T16:50:58.7705937Z + [[ default == *backward* ]] 2024-11-01T16:50:58.7706329Z + [[ default == *xla* ]] 2024-11-01T16:50:58.7706792Z + [[ default == *executorch* ]] 2024-11-01T16:50:58.7707276Z + [[ default == \j\i\t\_\l\e\g\a\c\y ]] 2024-11-01T16:50:58.7707895Z + [[ linux-focal-rocm6.2-py3.10 == *libtorch* ]] 2024-11-01T16:50:58.7708444Z + [[ default == distributed ]] 2024-11-01T16:50:58.7708894Z + [[ default == *inductor_distributed* ]] 2024-11-01T16:50:58.7709367Z + [[ default == *inductor-halide* ]] 2024-11-01T16:50:58.7709844Z + [[ default == *inductor-triton-cpu* ]] 2024-11-01T16:50:58.7710356Z + [[ default == *inductor-micro-benchmark* ]] 2024-11-01T16:50:58.7710815Z + [[ default == *huggingface* ]] 2024-11-01T16:50:58.7711203Z + [[ default == *timm* ]] 2024-11-01T16:50:58.7711562Z + [[ default == *torchbench* ]] 2024-11-01T16:50:58.7711969Z + [[ default == *inductor_cpp_wrapper* ]] 2024-11-01T16:50:58.7712404Z + [[ default == *inductor* ]] 2024-11-01T16:50:58.7712792Z + [[ default == *dynamo_wrapped* ]] 2024-11-01T16:50:58.7713270Z + [[ linux-focal-rocm6.2-py3.10 == *rocm* ]] 2024-11-01T16:50:58.7714472Z + [[ -n test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor distributed/test_c10d_common distributed/test_c10d_nccl ]] 2024-11-01T16:50:58.7715489Z + install_torchvision 2024-11-01T16:50:58.7715830Z + local orig_preload 2024-11-01T16:50:58.7716157Z + local commit 2024-11-01T16:50:58.7716488Z ++ get_pinned_commit vision 2024-11-01T16:50:58.7716880Z ++ cat .github/ci_commit_pins/vision.txt 2024-11-01T16:50:58.7750174Z + commit=d23a6e1664d20707c11781299611436e1f0c104f 2024-11-01T16:50:58.7751125Z + orig_preload= 2024-11-01T16:50:58.7751532Z + '[' -n '' ']' 2024-11-01T16:50:58.7752355Z + pip_install --no-use-pep517 --user git+https://github.com/pytorch/vision.git@d23a6e1664d20707c11781299611436e1f0c104f 2024-11-01T16:50:58.7753699Z + pip install --progress-bar off --no-use-pep517 --user git+https://github.com/pytorch/vision.git@d23a6e1664d20707c11781299611436e1f0c104f 2024-11-01T16:50:59.0596903Z Collecting git+https://github.com/pytorch/vision.git@d23a6e1664d20707c11781299611436e1f0c104f 2024-11-01T16:50:59.0601426Z Cloning https://github.com/pytorch/vision.git (to revision d23a6e1664d20707c11781299611436e1f0c104f) to /tmp/pip-req-build-4_j6cy0s 2024-11-01T16:50:59.0642830Z Running command git clone --filter=blob:none --quiet https://github.com/pytorch/vision.git /tmp/pip-req-build-4_j6cy0s 2024-11-01T16:51:05.0527922Z Running command git rev-parse -q --verify 'sha^d23a6e1664d20707c11781299611436e1f0c104f' 2024-11-01T16:51:05.0567268Z Running command git fetch -q https://github.com/pytorch/vision.git d23a6e1664d20707c11781299611436e1f0c104f 2024-11-01T16:51:06.3713503Z Running command git checkout -q d23a6e1664d20707c11781299611436e1f0c104f 2024-11-01T16:51:07.0787776Z Resolved https://github.com/pytorch/vision.git to commit d23a6e1664d20707c11781299611436e1f0c104f 2024-11-01T16:51:10.8888476Z Preparing metadata (setup.py) ... [?25l- \ | / - done 2024-11-01T16:51:10.8934970Z [?25hRequirement already satisfied: numpy in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torchvision==0.19.0a0+d23a6e1) (1.22.4) 2024-11-01T16:51:10.8940577Z Requirement already satisfied: torch in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torchvision==0.19.0a0+d23a6e1) (2.6.0a0+gitd1aa4ef) 2024-11-01T16:51:10.8950759Z Requirement already satisfied: pillow!=8.3.*,>=5.3.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torchvision==0.19.0a0+d23a6e1) (10.3.0) 2024-11-01T16:51:10.9070211Z Requirement already satisfied: filelock in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (3.16.1) 2024-11-01T16:51:10.9077819Z Requirement already satisfied: typing-extensions>=4.10.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (4.12.2) 2024-11-01T16:51:10.9082483Z Requirement already satisfied: networkx in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (2.8.8) 2024-11-01T16:51:10.9097273Z Requirement already satisfied: jinja2 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (3.1.4) 2024-11-01T16:51:10.9100410Z Requirement already satisfied: fsspec in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (2024.10.0) 2024-11-01T16:51:10.9103410Z Requirement already satisfied: sympy==1.13.1 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from torch->torchvision==0.19.0a0+d23a6e1) (1.13.1) 2024-11-01T16:51:10.9126656Z Requirement already satisfied: mpmath<1.4,>=1.1.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from sympy==1.13.1->torch->torchvision==0.19.0a0+d23a6e1) (1.3.0) 2024-11-01T16:51:10.9632849Z Requirement already satisfied: MarkupSafe>=2.0 in /opt/conda/envs/py_3.10/lib/python3.10/site-packages (from jinja2->torch->torchvision==0.19.0a0+d23a6e1) (3.0.2) 2024-11-01T16:51:10.9684822Z Building wheels for collected packages: torchvision 2024-11-01T16:51:54.1932916Z Building wheel for torchvision (setup.py) ... [?25l- \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - \ | / - done 2024-11-01T16:51:54.1968808Z [?25h Created wheel for torchvision: filename=torchvision-0.19.0a0+d23a6e1-cp310-cp310-linux_x86_64.whl size=1459213 sha256=8a79f6ff42e88c9d5a1c4c1c0e42c0256577d972bf75aac595918c7fea66cb26 2024-11-01T16:51:54.1973922Z Stored in directory: /var/lib/jenkins/.cache/pip/wheels/0e/56/35/02931e71eb23fd2b85591c7ec05b733ca7c8b328a2fd151f96 2024-11-01T16:51:54.2004150Z Successfully built torchvision 2024-11-01T16:51:54.4006315Z Installing collected packages: torchvision 2024-11-01T16:51:54.7296387Z Successfully installed torchvision-0.19.0a0+d23a6e1 2024-11-01T16:51:54.8360432Z + '[' -n '' ']' 2024-11-01T16:51:54.8363994Z + test_python_shard 1 2024-11-01T16:51:54.8364912Z + [[ -z 2 ]] 2024-11-01T16:51:54.8368238Z + python test/run_test.py --exclude-jit-executor --exclude-distributed-tests --include test_nn test_torch test_cuda test_ops test_unary_ufuncs test_binary_ufuncs test_autograd inductor/test_torchinductor distributed/test_c10d_common distributed/test_c10d_nccl --shard 1 2 --verbose --upload-artifacts-while-running 2024-11-01T16:51:54.9216549Z /var/lib/jenkins/pytorch/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-11-01T16:51:54.9218905Z import pkg_resources 2024-11-01T16:52:00.0169920Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/pytorch/test/.pytorch-disabled-tests.json 2024-11-01T16:52:00.4396120Z Ignoring disabled issues: [''] 2024-11-01T16:52:00.4649090Z Found test times from artifacts 2024-11-01T16:52:00.5203866Z Found test times from artifacts 2024-11-01T16:52:00.5215608Z Running all tests 2024-11-01T16:52:00.5221428Z Running parallel tests on 2 processes 2024-11-01T16:52:00.5222426Z Name: tests to run (est. time: 48.26min) 2024-11-01T16:52:00.5223240Z Serial tests (2): 2024-11-01T16:52:00.5223869Z test_torch 1/1 2024-11-01T16:52:00.5224469Z test_nn 1/1 2024-11-01T16:52:00.5225075Z Parallel tests (8): 2024-11-01T16:52:00.5229386Z inductor/test_torchinductor 4/6 2024-11-01T16:52:00.5230418Z test_ops 1/9 2024-11-01T16:52:00.5231127Z test_ops 2/9 2024-11-01T16:52:00.5231808Z test_ops 5/9 2024-11-01T16:52:00.5232452Z test_ops 6/9 2024-11-01T16:52:00.5233105Z test_ops 9/9 2024-11-01T16:52:00.5236913Z test_binary_ufuncs 1/1 2024-11-01T16:52:00.5237779Z test_unary_ufuncs 1/1 2024-11-01T16:52:00.5238527Z Name: excluded (est. time: 0.0min) 2024-11-01T16:52:00.5239303Z Serial tests (0): 2024-11-01T16:52:00.5239937Z Parallel tests (0): 2024-11-01T16:52:00.5287813Z Running test_torch 1/1 ... [2024-11-01 16:52:00.527999] 2024-11-01T16:52:00.5288943Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T16:52:00.5293145Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_torch.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 16:52:00.528287] 2024-11-01T16:56:30.2576684Z 2024-11-01T16:56:30.2583185Z test_torch 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_torch_1.1_07b0b0150c31a96c_.log 2024-11-01T16:56:30.3088740Z Running 1053 items in this shard: test/test_torch.py::TestBasicVitalSigns::test_basic_vitals, test/test_torch.py::TestBasicVitalSigns::test_basic_vitals_read_write, test/test_torch.py::TestBasicVitalSigns::test_dataloader_vitals, test/test_torch.py::TestTorch::test_RNGState, test/test_torch.py::TestTorch::test_RNGStateAliasing, test/test_torch.py::TestTorch::test_RNG_after_pickle, test/test_torch.py::TestTorch::test_Size, test/test_torch.py::TestTorch::test_Size_iter, test/test_torch.py::TestTorch::test_Size_scalar, test/test_torch.py::TestTorch::test_add_meta_scalar, test/test_torch.py::TestTorch::test_allow_tensor_metadata_change, test/test_torch.py::TestTorch::test_apply, test/test_torch.py::TestTorch::test_as_subclass, test/test_torch.py::TestTorch::test_assert_async, test/test_torch.py::TestTorch::test_backward_hooks_traverse, test/test_torch.py::TestTorch::test_batch_norm_cpu_inference, test/test_torch.py::TestTorch::test_bf16_supported_on_cpu, test/test_torch.py::TestTorch::test_bmm_multithreaded, test/test_torch.py::TestTorch::test_boxMullerState, test/test_torch.py::TestTorch::test_cat_neg_dim, test/test_torch.py::TestTorch::test_check, test/test_torch.py::TestTorch::test_chunk_neg_dim, test/test_torch.py::TestTorch::test_conj_neg_tolist, test/test_torch.py::TestTorch::test_contains, test/test_torch.py::TestTorch::test_copy_broadcast, test/test_torch.py::TestTorch::test_copy_dtypes, test/test_torch.py::TestTorch::test_copy_float16, test/test_torch.py::TestTorch::test_copy_many_to_one, test/test_torch.py::TestTorch::test_copy_transpose, test/test_torch.py::TestTorch::test_cuda_not_built, test/test_torch.py::TestTorch::test_cummax_neg_dim, test/test_torch.py::TestTorch::test_cummin_neg_dim, test/test_torch.py::TestTorch::test_cumprod_neg_dim, test/test_torch.py::TestTorch::test_cumsum_neg_dim, test/test_torch.py::TestTorch::test_cxx_flags, test/test_torch.py::TestTorch::test_data_ptr_of_empty_tensor_with_storage, test/test_torch.py::TestTorch::test_data_ptr_of_empty_view_with_storage, test/test_torch.py::TestTorch::test_deepcopy_gradient, test/test_torch.py::TestTorch::test_deepcopy_parameter, test/test_torch.py::TestTorch::test_deterministic_fill_uninitialized_memory, test/test_torch.py::TestTorch::test_deterministic_flag, test/test_torch.py::TestTorch::test_device, test/test_torch.py::TestTorch::test_dim_order, test/test_torch.py::TestTorch::test_dir, test/test_torch.py::TestTorch::test_doc, test/test_torch.py::TestTorch::test_doc_template, test/test_torch.py::TestTorch::test_dot_data_use, test/test_torch.py::TestTorch::test_dtype_is_signed, test/test_torch.py::TestTorch::test_element_size, test/test_torch.py::TestTorch::test_empty_meta, test/test_torch.py::TestTorch::test_empty_storage_view, test/test_torch.py::TestTorch::test_equal, test/test_torch.py::TestTorch::test_error_msg_type_translation, test/test_torch.py::TestTorch::test_fill_diagonal, test/test_torch.py::TestTorch::test_format_scalar_meta, test/test_torch.py::TestTorch::test_from_buffer, test/test_torch.py::TestTorch::test_from_file, test/test_torch.py::TestTorch::test_gather_neg_dim, test/test_torch.py::TestTorch::test_generator_cpu, test/test_torch.py::TestTorch::test_get_cpu_capability, test/test_torch.py::TestTorch::test_has_internal_overlap, test/test_torch.py::TestTorch::test_has_storage, test/test_torch.py::TestTorch::test_index_add, test/test_torch.py::TestTorch::test_index_add_all_dtypes, test/test_torch.py::TestTorch::test_index_add_cornercase, test/test_torch.py::TestTorch::test_index_add_correctness, test/test_torch.py::TestTorch::test_index_add_neg_dim, test/test_torch.py::TestTorch::test_index_copy_neg_dim, test/test_torch.py::TestTorch::test_index_fill_neg_dim, test/test_torch.py::TestTorch::test_index_select_neg_dim, test/test_torch.py::TestTorch::test_invalid_arg_error_handling, test/test_torch.py::TestTorch::test_invalid_generator_raises, test/test_torch.py::TestTorch::test_is_nonzero, test/test_torch.py::TestTorch::test_is_same_size, test/test_torch.py::TestTorch::test_iter, test/test_torch.py::TestTorch::test_kthvalue_neg_dim, test/test_torch.py::TestTorch::test_linspace_logspace, test/test_torch.py::TestTorch::test_logcumsumexp_neg_dim, test/test_torch.py::TestTorch::test_manual_seed, test/test_torch.py::TestTorch::test_map, test/test_torch.py::TestTorch::test_map2, test/test_torch.py::TestTorch::test_max_neg_dim, test/test_torch.py::TestTorch::test_mean_neg_dim, test/test_torch.py::TestTorch::test_median_neg_dim, test/test_torch.py::TestTorch::test_memory_format, test/test_torch.py::TestTorch::test_memory_format_contiguous_returns_same_tensor_if_already_satisfies, test/test_torch.py::TestTorch::test_memory_format_empty, test/test_torch.py::TestTorch::test_min_neg_dim, test/test_torch.py::TestTorch::test_mode_neg_dim, test/test_torch.py::TestTorch::test_multinomial_invalid_probs, test/test_torch.py::TestTorch::test_nanmedian_neg_dim, test/test_torch.py::TestTorch::test_narrow_neg_dim, test/test_torch.py::TestTorch::test_nbytes, test/test_torch.py::TestTorch::test_ndim, test/test_torch.py::TestTorch::test_new, test/test_torch.py::TestTorch::test_newaxis_numpy_comparison, test/test_torch.py::TestTorch::test_newindex, test/test_torch.py::TestTorch::test_no_cuda_monkeypatch, test/test_torch.py::TestTorch::test_norm_neg_dim, test/test_torch.py::TestTorch::test_normal_shape, test/test_torch.py::TestTorch::test_numel, test/test_torch.py::TestTorch::test_parallel_info, test/test_torch.py::TestTorch::test_parsing_double, test/test_torch.py::TestTorch::test_parsing_int64, test/test_torch.py::TestTorch::test_parsing_intlist, test/test_torch.py::TestTorch::test_permute, test/test_torch.py::TestTorch::test_pickle, test/test_torch.py::TestTorch::test_pickle_dtype, test/test_torch.py::TestTorch::test_pickle_function, test/test_torch.py::TestTorch::test_pickle_generator, test/test_torch.py::TestTorch::test_pickle_parameter, test/test_torch.py::TestTorch::test_pickle_parameter_no_requires_grad, test/test_torch.py::TestTorch::test_pickle_size, test/test_torch.py::TestTorch::test_pin_memory, test/test_torch.py::TestTorch::test_print, test/test_torch.py::TestTorch::test_prod_neg_dim, test/test_torch.py::TestTorch::test_pyobj_preserved, test/test_torch.py::TestTorch::test_qengine, test/test_torch.py::TestTorch::test_renorm_neg_dim, test/test_torch.py::TestTorch::test_resizable, test/test_torch.py::TestTorch::test_reversed, test/test_torch.py::TestTorch::test_scatter_neg_dim, test/test_torch.py::TestTorch::test_select_neg_dim, test/test_torch.py::TestTorch::test_set_flush_denormal, test/test_torch.py::TestTorch::test_setting_real_imag_to_a_number, test/test_torch.py::TestTorch::test_show_config, test/test_torch.py::TestTorch::test_size_neg_dim, test/test_torch.py::TestTorch::test_size_stride, test/test_torch.py::TestTorch::test_sizeof, test/test_torch.py::TestTorch::test_slice, test/test_torch.py::TestTorch::test_slow_test, test/test_torch.py::TestTorch::test_sobolengine_bounds, test/test_torch.py::TestTorch::test_sobolengine_bounds_scrambled, test/test_torch.py::TestTorch::test_sobolengine_continuing, test/test_torch.py::TestTorch::test_sobolengine_continuing_scrambled, test/test_torch.py::TestTorch::test_sobolengine_default_dtype, test/test_torch.py::TestTorch::test_sobolengine_distribution, test/test_torch.py::TestTorch::test_sobolengine_distribution_scrambled, test/test_torch.py::TestTorch::test_sobolengine_draw, test/test_torch.py::TestTorch::test_sobolengine_draw_base2, test/test_torch.py::TestTorch::test_sobolengine_draw_base2_scrambled, test/test_torch.py::TestTorch::test_sobolengine_draw_scrambled, test/test_torch.py::TestTorch::test_sobolengine_fast_forward, test/test_torch.py::TestTorch::test_sobolengine_fast_forward_scrambled, test/test_torch.py::TestTorch::test_sobolengine_first_point, test/test_torch.py::TestTorch::test_sobolengine_high_dim, test/test_torch.py::TestTorch::test_sobolengine_raise, test/test_torch.py::TestTorch::test_sobolengine_reset, test/test_torch.py::TestTorch::test_sobolengine_reset_scrambled, test/test_torch.py::TestTorch::test_sort_neg_dim, test/test_torch.py::TestTorch::test_split_neg_dim, test/test_torch.py::TestTorch::test_split_with_sizes_copy_out, test/test_torch.py::TestTorch::test_squeeze_neg_dim, test/test_torch.py::TestTorch::test_std_neg_dim, test/test_torch.py::TestTorch::test_storage_base_init, test/test_torch.py::TestTorch::test_storage_base_new, test/test_torch.py::TestTorch::test_storage_byteswap, test/test_torch.py::TestTorch::test_storage_casts, test/test_torch.py::TestTorch::test_storage_cycle_via_dict, test/test_torch.py::TestTorch::test_storage_cycle_via_slots, test/test_torch.py::TestTorch::test_storage_dead_weak_ref, test/test_torch.py::TestTorch::test_storage_dealloc, test/test_torch.py::TestTorch::test_storage_dealloc_resurrected, test/test_torch.py::TestTorch::test_storage_dealloc_subclass_resurrected, test/test_torch.py::TestTorch::test_storage_dealloc_subclass_zombie, test/test_torch.py::TestTorch::test_storage_dict_dealloc, test/test_torch.py::TestTorch::test_storage_error, test/test_torch.py::TestTorch::test_storage_error_no_attribute, test/test_torch.py::TestTorch::test_storage_finalizer_dealloc, test/test_torch.py::TestTorch::test_storage_fix_weakref_no_leak, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc_resurrected, test/test_torch.py::TestTorch::test_storage_from_tensor_dealloc_zombie, test/test_torch.py::TestTorch::test_storage_preserve_nonhermetic_in_hermetic_context, test/test_torch.py::TestTorch::test_storage_resurrected_weak_ref, test/test_torch.py::TestTorch::test_storage_slot_dealloc, test/test_torch.py::TestTorch::test_storage_weakref_dealloc, test/test_torch.py::TestTorch::test_structseq_repr, test/test_torch.py::TestTorch::test_subclass_preserved, test/test_torch.py::TestTorch::test_subclass_tensors, test/test_torch.py::TestTorch::test_sum_neg_dim, test/test_torch.py::TestTorch::test_swap_basic, test/test_torch.py::TestTorch::test_swap_fail_slots, test/test_torch.py::TestTorch::test_t_not_2d_error, test/test_torch.py::TestTorch::test_tensor_base_init, test/test_torch.py::TestTorch::test_tensor_base_new, test/test_torch.py::TestTorch::test_tensor_ctor_scalar, test/test_torch.py::TestTorch::test_tensor_cycle_via_dict, test/test_torch.py::TestTorch::test_tensor_cycle_via_slots, test/test_torch.py::TestTorch::test_tensor_dead_weak_ref, test/test_torch.py::TestTorch::test_tensor_dict_dealloc, test/test_torch.py::TestTorch::test_tensor_finalizer_dealloc, test/test_torch.py::TestTorch::test_tensor_fix_weakref_no_leak, test/test_torch.py::TestTorch::test_tensor_ressurecting_clear, test/test_torch.py::TestTorch::test_tensor_resurrected_weak_ref, test/test_torch.py::TestTorch::test_tensor_set, test/test_torch.py::TestTorch::test_tensor_set_errors, test/test_torch.py::TestTorch::test_tensor_slot_dealloc, test/test_torch.py::TestTorch::test_tensor_weakref_dealloc, test/test_torch.py::TestTorch::test_tensor_where_scalar, test/test_torch.py::TestTorch::test_tensoriterator_output_setup, test/test_torch.py::TestTorch::test_terminate_handler_on_crash, test/test_torch.py::TestTorch::test_to, test/test_torch.py::TestTorch::test_to_with_tensor, test/test_torch.py::TestTorch::test_topk_neg_dim, test/test_torch.py::TestTorch::test_torch_from_file, test/test_torch.py::TestTorch::test_transpose_neg_dim, test/test_torch.py::TestTorch::test_type, test/test_torch.py::TestTorch::test_type_alias, test/test_torch.py::TestTorch::test_type_conversion_via_dtype_name, test/test_torch.py::TestTorch::test_typed_storage_deprecation_warning, test/test_torch.py::TestTorch::test_typed_storage_internal_no_warning, test/test_torch.py::TestTorch::test_unbind_neg_dim, test/test_torch.py::TestTorch::test_unflatten, test/test_torch.py::TestTorch::test_unfold_neg_dim, test/test_torch.py::TestTorch::test_unsqueeze_neg_dim, test/test_torch.py::TestTorch::test_upsample_nearest1d_meta, test/test_torch.py::TestTorch::test_upsample_nearest2d_meta, test/test_torch.py::TestTorch::test_var_neg_dim, test/test_torch.py::TestTorch::test_warn_types, test/test_torch.py::TestTorch::test_wildcard_import, test/test_torch.py::TestVitalSignsCudaCUDA::test_cuda_vitals_gpu_only_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcdiv_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_addcmul_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_assertRaisesRegex_ignore_msg_non_native_device_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_edge_cases_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_p_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bernoulli_self_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bfloat16_neg_abs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bool_tensor_value_change_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_add_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_addcdiv_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_addcmul_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_atan2_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_copy_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_dist_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_div_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_eq_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_fmod_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_ge_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_gt_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_le_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_lerp_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_lt_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_map2_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_map_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_fill_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_scatter_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_masked_select_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_max_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_min_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_mul_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_ne_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_pow_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_remainder_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_broadcast_fn_sub_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_bytes_to_scalar_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_no_inf_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cauchy_no_inf_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_cuda_backward_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_euclidean_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_grad_p_lt_1_no_nan_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_large_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_non_contiguous_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_non_contiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_norm_batch_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_norm_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cdist_same_inputs_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_check_tensor_all_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_check_tensor_internal_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_all_dtypes_and_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_not_memory_dense_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_clone_zero_stride_dim_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_complex_half_experimental_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_constants_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_conv_transposed_backward_agnostic_to_memory_format_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_conv_transposed_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy__cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_all_dtypes_and_devices_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_math_view_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_mem_overlap_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_copy_transpose_math_view_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_corrcoef_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cov_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cpp_warnings_have_python_context_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cublas_config_nondeterministic_alert_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummax_cummin_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummax_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cummin_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumprod_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deepcopy_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_empty_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_replication_pad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_deterministic_resize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_device_guard_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_diff_noncontig_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dim_function_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_discontiguous_out_cumsum_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dist_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_dtypetensor_warnings_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_errors_index_copy_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_expected_failure_xla_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_no_zero_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_exponential_no_zero_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gather_backward_deterministic_path_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gather_backward_one_dim_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_geometric_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scale_will_not_overflow_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaler_deprecated_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaler_pass_itself_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_accumulation_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach0_fused0_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach2_fused_True_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_autocast_foreach_True_fused1_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_clipping_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_clipping_separate_unscale_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_multiple_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_penalty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_state_dict_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_unscale_sparse_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_grad_scaling_update_scale_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_all_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_extreme_cases_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_spacing_list_length_error_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_gradient_type_promotion_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_hook_remove_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_add_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_add_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_copy_scalars_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_fill_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_put_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_put_non_accumulate_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amax_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_amin_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_mean_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_reduce_reduce_prod_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float8_e4m3fn, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float8_e4m3fnuz, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float8_e5m2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_float8_e5m2fnuz, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_index_select_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_invalid_shapes_grid_sampler_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_is_set_to_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_is_signed_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e4m3fn, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e4m3fnuz, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e5m2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_float8_e5m2fnuz, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_item_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_large_cumprod_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_large_cumsum_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_binary_op_no_materialize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lazy_clone_view_materialize_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_log_normal_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_logcumsumexp_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_lognormal_kstest_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_bool_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bfloat16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bfloat16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bool_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_bool_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex128_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex128_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_complex64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float32_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float32_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_float64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int16_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int16_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int32_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int32_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int64_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int64_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int8_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_int8_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_uint8_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_cuda_uint8_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_fill_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_bool_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_inplace_noncontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_large_tensor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_scatter_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_masked_select_discontiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_clone_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_consistency_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_cpu_and_cuda_ops_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_empty_like_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_factory_like_functions_preserve_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_operators_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_preserved_after_permute_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_propagation_rules_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_to_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_type_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_memory_format_type_shortcuts_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_module_share_memory_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cpu_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_deterministic_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_device_constrain_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_empty_w_replacement_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_empty_wo_replacement_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_gpu_device_constrain_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_multinomial_rng_state_advance_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_narrow_copy_non_contiguous_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_narrow_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveAvgPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveAvgPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AdaptiveMaxPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_AvgPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_CTCLoss_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_EmbeddingBag_max_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_FractionalMaxPool2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_FractionalMaxPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxPool3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool1d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool2d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_MaxUnpool3d_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_NLLLoss_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad1d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReflectionPad3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad1d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_ReplicationPad3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_bincount_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_cumsum_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_grid_sample_2d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_grid_sample_3d_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_histc_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_bicubic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_bilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_linear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_interpolate_trilinear_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_kthvalue_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_median_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_put_accumulate_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_alert_put_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_qint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_qint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint2x4, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint4x2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nondeterministic_resize_quantized_cuda_quint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_normal_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_nullary_op_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pairwise_distance_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_parallel_cow_materialize_error_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_AdamW_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_Adam_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_params_invalidated_with_grads_invalidated_between_unscale_and_step_SGD_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pdist_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pdist_norm_large_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pickle_gradscaler_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_pin_memory_from_constructor_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_accumulate_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_put_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_reduced_type_float_copy_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_reduced_type_float_copy_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_repeat_interleave_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scalar_check_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_bool_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_non_unique_index_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_one_dim_deterministic_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_add_to_large_input_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_bool_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_multiply_unsupported_dtypes_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_multiply_unsupported_dtypes_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_non_unique_index_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_operations_to_large_input_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_reduce_scalar_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_to_large_input_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_scatter_zero_size_index_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_serialization_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_default_tensor_type_warnings_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_set_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_shift_mem_overlap_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_skip_xla_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_all_devices_non_blocking_False_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_all_devices_non_blocking_True_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_errors_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_from_tensor_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_meta_ok_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_qint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_qint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_quint4x2, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_quint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_storage_setitem_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_strides_propagation_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_sync_warning_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_take_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_from_storage_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_set_errors_multigpu_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_shape_empty_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_storage_type_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_tensor_type_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_ternary_op_mem_overlap_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_bool, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_complex128, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_complex64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_int8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_typed_storage_meta_cuda_uint8, test/test_torch.py::TestTorchDeviceTypeCUDA::test_unfold_all_devices_and_dtypes_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_unfold_scalars_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_bfloat16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float16, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float32, test/test_torch.py::TestTorchDeviceTypeCUDA::test_uniform_kstest_cuda_float64, test/test_torch.py::TestTorchDeviceTypeCUDA::test_untyped_storage_meta_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_warn_always_caught_cuda, test/test_torch.py::TestTorchDeviceTypeCUDA::test_where_scalar_handcrafted_values_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_advancedindex_mixed_cpu_devices_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_advancedindex_mixed_devices_error_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_float32, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_float64, test/test_torch.py::TestDevicePrecisionCUDA::test_clamp_cuda_int64, test/test_torch.py::TestDevicePrecisionCUDA::test_copy_broadcast_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_copy_noncontig_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_cuda_device_idx_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_device_serialization_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float16, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float32, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_float64, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int16, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int32, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int64, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_int8, test/test_torch.py::TestDevicePrecisionCUDA::test_from_sequence_cuda_uint8, test/test_torch.py::TestDevicePrecisionCUDA::test_index_add_bfloat16_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_multidevice_serialization_cuda, test/test_torch.py::TestDevicePrecisionCUDA::test_type_conversions_same_device_cuda 2024-11-01T16:56:30.3375240Z 2024-11-01T16:56:30.3375548Z Running test_nn 1/1 ... [2024-11-01 16:56:30.260364] 2024-11-01T16:56:30.3375964Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T16:56:30.3377161Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_nn.py', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 16:56:30.260943] 2024-11-01T17:02:37.2684831Z 2024-11-01T17:02:37.2691091Z test_nn 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_nn_1.1_275490cb9e8e1934_.log 2024-11-01T17:02:37.4109558Z Running 2201 items in this shard: test/test_nn.py::TestNN::test_AdaptiveLogSoftmax, test/test_nn.py::TestNN::test_AdaptiveLogSoftmax_cuda, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_BCELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCELoss_no_reduce, test/test_nn.py::TestNN::test_BCELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar, test/test_nn.py::TestNN::test_BCELoss_weights_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_legacy_enum_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_BCEWithLogitsLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_CELU_no_batch_dim, test/test_nn.py::TestNN::test_CELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_CTCLoss_critical_target_len, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cpu, test/test_nn.py::TestNN::test_CTCLoss_lengthchecks_cuda, test/test_nn.py::TestNN::test_CTCLoss_long_targets, test/test_nn.py::TestNN::test_CTCLoss_typechecks, test/test_nn.py::TestNN::test_CTCLoss_zero_infinity, test/test_nn.py::TestNN::test_CTCLoss_zero_lengths, test/test_nn.py::TestNN::test_Conv1d, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_cuda, test/test_nn.py::TestNN::test_Conv1d_dilated, test/test_nn.py::TestNN::test_Conv1d_dilated_cuda, test/test_nn.py::TestNN::test_Conv1d_groups, test/test_nn.py::TestNN::test_Conv1d_groups_cuda, test/test_nn.py::TestNN::test_Conv1d_pad1, test/test_nn.py::TestNN::test_Conv1d_pad1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad1size1, test/test_nn.py::TestNN::test_Conv1d_pad1size1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad2, test/test_nn.py::TestNN::test_Conv1d_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_pad2size1, test/test_nn.py::TestNN::test_Conv1d_pad2size1_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same, test/test_nn.py::TestNN::test_Conv1d_pad_same2, test/test_nn.py::TestNN::test_Conv1d_pad_same2_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv1d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv1d_pad_valid, test/test_nn.py::TestNN::test_Conv1d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_reflect_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv1d_stride, test/test_nn.py::TestNN::test_Conv1d_stride_cuda, test/test_nn.py::TestNN::test_Conv1d_zero_batch, test/test_nn.py::TestNN::test_Conv1d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv1d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise, test/test_nn.py::TestNN::test_Conv2d_depthwise_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated, test/test_nn.py::TestNN::test_Conv2d_depthwise_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded, test/test_nn.py::TestNN::test_Conv2d_depthwise_padded_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided, test/test_nn.py::TestNN::test_Conv2d_depthwise_strided_cuda, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier, test/test_nn.py::TestNN::test_Conv2d_depthwise_with_multiplier_cuda, test/test_nn.py::TestNN::test_Conv2d_dilated, test/test_nn.py::TestNN::test_Conv2d_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_dilated_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_groups, test/test_nn.py::TestNN::test_Conv2d_groups_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_thnn, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_groups_thnn_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_no_bias, test/test_nn.py::TestNN::test_Conv2d_no_bias_cuda, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_same, test/test_nn.py::TestNN::test_Conv2d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv2d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv2d_pad_valid, test/test_nn.py::TestNN::test_Conv2d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv2d_padding, test/test_nn.py::TestNN::test_Conv2d_padding_cuda, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_padding_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_reflect_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv2d_strided, test/test_nn.py::TestNN::test_Conv2d_strided_cuda, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_strided_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_zero_batch, test/test_nn.py::TestNN::test_Conv2d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv2d_zero_batch_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv2d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_cuda, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_1x1x1_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_circular_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d_cuda, test/test_nn.py::TestNN::test_Conv3d_dilated, test/test_nn.py::TestNN::test_Conv3d_dilated_cuda, test/test_nn.py::TestNN::test_Conv3d_dilated_strided, test/test_nn.py::TestNN::test_Conv3d_dilated_strided_cuda, test/test_nn.py::TestNN::test_Conv3d_groups, test/test_nn.py::TestNN::test_Conv3d_groups_cuda, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_no_bias, test/test_nn.py::TestNN::test_Conv3d_no_bias_cuda, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_same, test/test_nn.py::TestNN::test_Conv3d_pad_same_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated, test/test_nn.py::TestNN::test_Conv3d_pad_same_dilated_cuda, test/test_nn.py::TestNN::test_Conv3d_pad_valid, test/test_nn.py::TestNN::test_Conv3d_pad_valid_cuda, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_replicate_stride2_pad2_cuda, test/test_nn.py::TestNN::test_Conv3d_stride, test/test_nn.py::TestNN::test_Conv3d_stride_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_padding, test/test_nn.py::TestNN::test_Conv3d_stride_padding_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_padding_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_stride_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_zero_batch, test/test_nn.py::TestNN::test_Conv3d_zero_batch_cuda, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor, test/test_nn.py::TestNN::test_Conv3d_zero_batch_with_long_tensor_cuda, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2, test/test_nn.py::TestNN::test_Conv3d_zeros_stride2_pad2_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d, test/test_nn.py::TestNN::test_ConvTranspose1d_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated, test/test_nn.py::TestNN::test_ConvTranspose1d_dilated_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_groups, test/test_nn.py::TestNN::test_ConvTranspose1d_groups_cuda, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias, test/test_nn.py::TestNN::test_ConvTranspose1d_no_bias_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d, test/test_nn.py::TestNN::test_ConvTranspose2d_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_dilated_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_groups, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_groups_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_no_bias_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor, test/test_nn.py::TestNN::test_ConvTranspose2d_with_long_tensor_cuda, test/test_nn.py::TestNN::test_ConvTranspose3d, test/test_nn.py::TestNN::test_ConvTranspose3d_cuda, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated, test/test_nn.py::TestNN::test_ConvTranspose3d_dilated_cuda, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_CosineEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_CrossMapLRN2d, test/test_nn.py::TestNN::test_CrossMapLRN2d_cuda, test/test_nn.py::TestNN::test_ELU_no_batch_dim, test/test_nn.py::TestNN::test_ELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Embedding, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous, test/test_nn.py::TestNN::test_EmbeddingBag_discontiguous_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_max, test/test_nn.py::TestNN::test_EmbeddingBag_max_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_max_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean, test/test_nn.py::TestNN::test_EmbeddingBag_mean_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_mean_padding_idx_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sparse, test/test_nn.py::TestNN::test_EmbeddingBag_sparse_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum, test/test_nn.py::TestNN::test_EmbeddingBag_sum_cuda, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx, test/test_nn.py::TestNN::test_EmbeddingBag_sum_padding_idx_cuda, test/test_nn.py::TestNN::test_Embedding_cuda, test/test_nn.py::TestNN::test_Embedding_discontiguous, test/test_nn.py::TestNN::test_Embedding_discontiguous_cuda, test/test_nn.py::TestNN::test_Embedding_sparse, test/test_nn.py::TestNN::test_Embedding_sparse_cuda, test/test_nn.py::TestNN::test_Flatten, test/test_nn.py::TestNN::test_Flatten_cuda, test/test_nn.py::TestNN::test_Flatten_no_batch_dim, test/test_nn.py::TestNN::test_Flatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Fold, test/test_nn.py::TestNN::test_Fold_cuda, test/test_nn.py::TestNN::test_Fold_int_input, test/test_nn.py::TestNN::test_Fold_int_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input, test/test_nn.py::TestNN::test_Fold_no_batch_dim_input_cuda, test/test_nn.py::TestNN::test_Fold_no_batch_dim_int_input, test/test_nn.py::TestNN::test_Fold_no_batch_dim_int_input_cuda, test/test_nn.py::TestNN::test_GELU_no_batch_dim, test/test_nn.py::TestNN::test_GELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_GLU_no_batch_dim, test/test_nn.py::TestNN::test_GLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim, test/test_nn.py::TestNN::test_Hardshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Hardsigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim, test/test_nn.py::TestNN::test_Hardswish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim, test/test_nn.py::TestNN::test_Hardtanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce, test/test_nn.py::TestNN::test_HingeEmbeddingLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_HuberLoss_delta, test/test_nn.py::TestNN::test_HuberLoss_delta_cuda, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_HuberLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_batch_mean, test/test_nn.py::TestNN::test_KLDivLoss_batch_mean_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_KLDivLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_log_target, test/test_nn.py::TestNN::test_KLDivLoss_no_reduce_scalar_log_target_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_with_log_target_no_reduce_cuda, test/test_nn.py::TestNN::test_KLDivLoss_with_target_no_reduce, test/test_nn.py::TestNN::test_KLDivLoss_with_target_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_L1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_L1Loss_no_reduce, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex, test/test_nn.py::TestNN::test_L1Loss_no_reduce_complex_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_L1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_LSTM_cell, test/test_nn.py::TestNN::test_LSTM_cell_forward_hidden_size, test/test_nn.py::TestNN::test_LSTM_cell_forward_input_size, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_cuda, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval, test/test_nn.py::TestNN::test_LayerNorm_3d_no_affine_large_feature_eval_cuda, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim, test/test_nn.py::TestNN::test_LeakyReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Linear, test/test_nn.py::TestNN::test_Linear_cuda, test/test_nn.py::TestNN::test_Linear_no_batch_dim, test/test_nn.py::TestNN::test_Linear_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Linear_no_bias, test/test_nn.py::TestNN::test_Linear_no_bias_cuda, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim, test/test_nn.py::TestNN::test_LogSigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MSELoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MSELoss_no_reduce, test/test_nn.py::TestNN::test_MSELoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar, test/test_nn.py::TestNN::test_MSELoss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MarginRankingLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MaxUnpool1d_net, test/test_nn.py::TestNN::test_MaxUnpool1d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool1d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool1d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool2d_net, test/test_nn.py::TestNN::test_MaxUnpool2d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool2d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net, test/test_nn.py::TestNN::test_MaxUnpool3d_net_cuda, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim, test/test_nn.py::TestNN::test_MaxUnpool3d_net_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Mish_no_batch_dim, test/test_nn.py::TestNN::test_Mish_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ModuleDict, test/test_nn.py::TestNN::test_ModuleList, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_0d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_index_neg_cuda, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiLabelMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_weights_no_reduce, test/test_nn.py::TestNN::test_MultiLabelSoftMarginLoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_1d_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_margin_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_p_no_reduce_cuda, test/test_nn.py::TestNN::test_MultiMarginLoss_weights_no_reduce, test/test_nn.py::TestNN::test_MultiMarginLoss_weights_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss2d_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLossNd_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_NLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_NLLLoss_no_reduce, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_cuda, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg, test/test_nn.py::TestNN::test_NLLLoss_no_reduce_weights_ignore_index_neg_cuda, test/test_nn.py::TestNN::test_PReLU_backward_requires_grad_false, test/test_nn.py::TestNN::test_PReLU_no_batch_dim, test/test_nn.py::TestNN::test_PReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_lhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs, test/test_nn.py::TestNN::test_PairwiseDistance_broadcast_rhs_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim, test/test_nn.py::TestNN::test_PairwiseDistance_no_batch_dim_cuda, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args, test/test_nn.py::TestNN::test_PairwiseDistance_with_non_default_args_cuda, test/test_nn.py::TestNN::test_ParameterDict, test/test_nn.py::TestNN::test_ParameterDict_replication, test/test_nn.py::TestNN::test_ParameterList, test/test_nn.py::TestNN::test_ParameterList_meta, test/test_nn.py::TestNN::test_ParameterList_replication, test/test_nn.py::TestNN::test_PixelShuffle, test/test_nn.py::TestNN::test_PixelShuffle_cuda, test/test_nn.py::TestNN::test_PixelUnshuffle, test/test_nn.py::TestNN::test_PixelUnshuffle_cuda, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce, test/test_nn.py::TestNN::test_PoissonNLLLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_RNN_cell, test/test_nn.py::TestNN::test_RNN_cell_forward_zero_hidden_size, test/test_nn.py::TestNN::test_RNN_cell_no_broadcasting, test/test_nn.py::TestNN::test_RNN_change_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_no_dropout, test/test_nn.py::TestNN::test_RNN_cpu_vs_cudnn_with_dropout, test/test_nn.py::TestNN::test_RNN_cudnn_weight_norm, test/test_nn.py::TestNN::test_RNN_dropout, test/test_nn.py::TestNN::test_RNN_dropout_state, test/test_nn.py::TestNN::test_RNN_input_size_zero, test/test_nn.py::TestNN::test_RNN_nonlinearity, test/test_nn.py::TestNN::test_RNN_nonlinearity_passed_as_arg, test/test_nn.py::TestNN::test_RReLU, test/test_nn.py::TestNN::test_RReLU_cuda, test/test_nn.py::TestNN::test_RReLU_no_batch_dim, test/test_nn.py::TestNN::test_RReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_RReLU_with_up_down, test/test_nn.py::TestNN::test_RReLU_with_up_down_cuda, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar, test/test_nn.py::TestNN::test_RReLU_with_up_down_scalar_cuda, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim, test/test_nn.py::TestNN::test_ReLU6_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReLU_no_batch_dim, test/test_nn.py::TestNN::test_ReLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d, test/test_nn.py::TestNN::test_ReplicationPad3d_complex, test/test_nn.py::TestNN::test_ReplicationPad3d_complex_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d_cuda, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim, test/test_nn.py::TestNN::test_ReplicationPad3d_no_batch_dim_cuda, test/test_nn.py::TestNN::test_SELU_no_batch_dim, test/test_nn.py::TestNN::test_SELU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Sequential_add, test/test_nn.py::TestNN::test_Sequential_append, test/test_nn.py::TestNN::test_Sequential_delitem, test/test_nn.py::TestNN::test_Sequential_extend, test/test_nn.py::TestNN::test_Sequential_getitem, test/test_nn.py::TestNN::test_Sequential_iadd, test/test_nn.py::TestNN::test_Sequential_imul, test/test_nn.py::TestNN::test_Sequential_insert, test/test_nn.py::TestNN::test_Sequential_insert_fail_case, test/test_nn.py::TestNN::test_Sequential_mul, test/test_nn.py::TestNN::test_Sequential_pop, test/test_nn.py::TestNN::test_Sequential_rmul, test/test_nn.py::TestNN::test_Sequential_setitem, test/test_nn.py::TestNN::test_Sequential_setitem_named, test/test_nn.py::TestNN::test_SiLU_no_batch_dim, test/test_nn.py::TestNN::test_SiLU_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim, test/test_nn.py::TestNN::test_Sigmoid_no_batch_dim_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_beta, test/test_nn.py::TestNN::test_SmoothL1Loss_beta_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_SmoothL1Loss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar, test/test_nn.py::TestNN::test_SmoothL1Loss_no_reduce_scalar_cuda, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta, test/test_nn.py::TestNN::test_SmoothL1Loss_zero_beta_cuda, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_SoftMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce, test/test_nn.py::TestNN::test_SoftMarginLoss_no_reduce_cuda, test/test_nn.py::TestNN::test_Softplus_no_batch_dim, test/test_nn.py::TestNN::test_Softplus_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim, test/test_nn.py::TestNN::test_Softshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Softsign_no_batch_dim, test/test_nn.py::TestNN::test_Softsign_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanh_no_batch_dim, test/test_nn.py::TestNN::test_Tanh_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim, test/test_nn.py::TestNN::test_Tanhshrink_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Threshold_no_batch_dim, test/test_nn.py::TestNN::test_Threshold_no_batch_dim_cuda, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_gelu_activation_cuda, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerDecoderLayer_relu_activation_cuda, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_gelu_activation_cuda, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation, test/test_nn.py::TestNN::test_TransformerEncoderLayer_relu_activation_cuda, test/test_nn.py::TestNN::test_Transformer_cell, test/test_nn.py::TestNN::test_Transformer_multilayer_coder, test/test_nn.py::TestNN::test_Transformer_multilayer_coder_cuda, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_mean_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_none_cuda_half, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_double, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_float, test/test_nn.py::TestNN::test_TripletMarginLoss_no_batch_dim_sum_cuda_half, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim, test/test_nn.py::TestNN::test_Unflatten_no_batch_dim_cuda, test/test_nn.py::TestNN::test_Unfold, test/test_nn.py::TestNN::test_Unfold_cuda, test/test_nn.py::TestNN::test_Unfold_int_input, test/test_nn.py::TestNN::test_Unfold_int_input_cuda, test/test_nn.py::TestNN::test_adaptive_log_softmax, test/test_nn.py::TestNN::test_add_module, test/test_nn.py::TestNN::test_add_module_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_affine_grid, test/test_nn.py::TestNN::test_affine_grid_3d, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cpu_nd_2, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cpu_nd_3, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cuda_nd_2, test/test_nn.py::TestNN::test_affine_grid_backward_cl_cf_consistency_device_cuda_nd_3, test/test_nn.py::TestNN::test_affine_grid_error_checking, test/test_nn.py::TestNN::test_assignment, test/test_nn.py::TestNN::test_batch_norm_update_stats, test/test_nn.py::TestNN::test_batchnorm_buffer_update_when_stats_are_not_tracked, test/test_nn.py::TestNN::test_batchnorm_cudnn_half, test/test_nn.py::TestNN::test_batchnorm_cudnn_nhwc, test/test_nn.py::TestNN::test_batchnorm_half_overflow, test/test_nn.py::TestNN::test_batchnorm_load_state_dict, test/test_nn.py::TestNN::test_batchnorm_nhwc_cpu, test/test_nn.py::TestNN::test_batchnorm_nhwc_cuda, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_BatchNorm2d, test/test_nn.py::TestNN::test_batchnorm_non_contig_cpu_SyncBatchNorm, test/test_nn.py::TestNN::test_batchnorm_nonaffine_cuda_half_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_bias_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_less_than_one_value_per_channel, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_mean_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_is_not_same_size_as_input, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_running_var_or_running_mean_have_forward_grad, test/test_nn.py::TestNN::test_batchnorm_raises_error_if_weight_is_not_same_size_as_input, test/test_nn.py::TestNN::test_bce_loss_always_nonnegative, test/test_nn.py::TestNN::test_bce_loss_broadcasts_weights, test/test_nn.py::TestNN::test_bce_loss_input_range, test/test_nn.py::TestNN::test_bce_loss_size_mismatch, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_pos_weights, test/test_nn.py::TestNN::test_bce_with_logits_broadcasts_weights, test/test_nn.py::TestNN::test_bce_with_logits_gives_same_result_as_sigmoid_and_bce_loss, test/test_nn.py::TestNN::test_bce_with_logits_gives_same_result_as_sigmoid_and_bce_loss_large_tensors_with_grad, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_forward_grad, test/test_nn.py::TestNN::test_bce_with_logits_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bce_with_logits_ones_in_pos_weights_are_the_same_as_none, test/test_nn.py::TestNN::test_bce_with_logits_raises_if_target_and_input_are_different_size, test/test_nn.py::TestNN::test_bce_with_logits_stability, test/test_nn.py::TestNN::test_bce_with_logits_with_pos_weight_has_correct_grad_at_zero, test/test_nn.py::TestNN::test_bilinear, test/test_nn.py::TestNN::test_bilinear_broadcasting, test/test_nn.py::TestNN::test_bilinear_no_bias, test/test_nn.py::TestNN::test_bilinear_non_contiguous, test/test_nn.py::TestNN::test_broadcast_double_backwards_gpu, test/test_nn.py::TestNN::test_broadcast_no_grad, test/test_nn.py::TestNN::test_broadcast_not_requiring_grad, test/test_nn.py::TestNN::test_buffer_bad_module_subclass, test/test_nn.py::TestNN::test_buffer_not_persistent, test/test_nn.py::TestNN::test_buffer_not_persistent_assign, test/test_nn.py::TestNN::test_buffer_not_persistent_del, test/test_nn.py::TestNN::test_buffer_not_persistent_load, test/test_nn.py::TestNN::test_buffer_not_persistent_overwrite, test/test_nn.py::TestNN::test_buffers_and_named_buffers, test/test_nn.py::TestNN::test_call_supports_python_dict_output, test/test_nn.py::TestNN::test_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_children, test/test_nn.py::TestNN::test_container_copy, test/test_nn.py::TestNN::test_convert_sync_batchnorm, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_diff_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_error_on_nonexpandable_shapes, test/test_nn.py::TestNN::test_cosine_embedding_loss_invalid_shape, test/test_nn.py::TestNN::test_cosine_embedding_loss_margin_no_reduce, test/test_nn.py::TestNN::test_cosine_embedding_loss_no_reduce, test/test_nn.py::TestNN::test_cosine_embedding_loss_with_diff_type, test/test_nn.py::TestNN::test_cosine_similarity, test/test_nn.py::TestNN::test_cross_entropy_loss, test/test_nn.py::TestNN::test_cross_entropy_loss_precision, test/test_nn.py::TestNN::test_cross_entropy_loss_zero_div, test/test_nn.py::TestNN::test_cudnn_forward_exception, test/test_nn.py::TestNN::test_cudnn_rnn_dropout_states_device, test/test_nn.py::TestNN::test_cudnn_weight_format, test/test_nn.py::TestNN::test_cudnn_weight_tying, test/test_nn.py::TestNN::test_dir, test/test_nn.py::TestNN::test_dir_digit, test/test_nn.py::TestNN::test_elu_inplace_gradgrad, test/test_nn.py::TestNN::test_elu_inplace_on_view, test/test_nn.py::TestNN::test_error_RNN_seq_len_zero, test/test_nn.py::TestNN::test_extra_state, test/test_nn.py::TestNN::test_extra_state_missing_get_extra_state, test/test_nn.py::TestNN::test_extra_state_missing_set_extra_state, test/test_nn.py::TestNN::test_extra_state_non_dict, test/test_nn.py::TestNN::test_fb_fc_packed, test/test_nn.py::TestNN::test_flatten, test/test_nn.py::TestNN::test_fold_invalid_arg, test/test_nn.py::TestNN::test_fractional_max_pool2d_invalid_output_ratio, test/test_nn.py::TestNN::test_gaussian_nll_loss_args, test/test_nn.py::TestNN::test_gaussian_nll_loss_broadcasting, test/test_nn.py::TestNN::test_get_buffer, test/test_nn.py::TestNN::test_get_buffer_from_submodules, test/test_nn.py::TestNN::test_getattr_with_property, test/test_nn.py::TestNN::test_grid_sample, test/test_nn.py::TestNN::test_grid_sample_3d, test/test_nn.py::TestNN::test_grid_sample_error_checking, test/test_nn.py::TestNN::test_grid_sample_nearest_neighbor_rounding_mode_consistency, test/test_nn.py::TestNN::test_hardtanh_backward, test/test_nn.py::TestNN::test_hardtanh_inplace_gradgrad, test/test_nn.py::TestNN::test_huber_loss_invalid_delta, test/test_nn.py::TestNN::test_inplace_thnn, test/test_nn.py::TestNN::test_interpolate, test/test_nn.py::TestNN::test_interpolate_bicubic_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_bicubic_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_scale_tuple_skewed_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bicubic_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_bilinear_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_shared_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_scale_tuple_skewed_2d_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_bilinear_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_buffer_overflow, test/test_nn.py::TestNN::test_interpolate_illegal_memory_access, test/test_nn.py::TestNN::test_interpolate_linear_1d, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners, test/test_nn.py::TestNN::test_interpolate_linear_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_1d_zero_dim, test/test_nn.py::TestNN::test_interpolate_linear_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_align_corners, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_linear_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d, test/test_nn.py::TestNN::test_interpolate_linear_tuple_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_1d, test/test_nn.py::TestNN::test_interpolate_nearest_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_1d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d, test/test_nn.py::TestNN::test_interpolate_nearest_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_launch_configs, test/test_nn.py::TestNN::test_interpolate_nearest_2d_launch_configs_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_2d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_3d, test/test_nn.py::TestNN::test_interpolate_nearest_3d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_nearest_3d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_2d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_scale_3d, test/test_nn.py::TestNN::test_interpolate_nearest_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_1d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_1d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_2d_cuda, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d, test/test_nn.py::TestNN::test_interpolate_nearest_tuple_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim, test/test_nn.py::TestNN::test_interpolate_trilinear_3d_zero_dim_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_scale_3d_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_align_corners, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_align_corners_cuda, test/test_nn.py::TestNN::test_interpolate_trilinear_tuple_3d_cuda, test/test_nn.py::TestNN::test_interpolate_undefined_behavior_casting, test/test_nn.py::TestNN::test_kl_div_log_softmax_target, test/test_nn.py::TestNN::test_kl_div_with_diff_type, test/test_nn.py::TestNN::test_kl_div_with_diff_type_log_target, test/test_nn.py::TestNN::test_l1_loss_correct, test/test_nn.py::TestNN::test_layer_norm_eps, test/test_nn.py::TestNN::test_layer_norm_grads_with_create_graph_flag, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cpu_nobias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_bias_weightStrided, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCOO, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSC, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightCSR, test/test_nn.py::TestNN::test_linear_autograd_device_cuda_nobias_weightStrided, test/test_nn.py::TestNN::test_linear_broadcasting, test/test_nn.py::TestNN::test_linear_raise_on_scalar_input, test/test_nn.py::TestNN::test_log_softmax_dim0, test/test_nn.py::TestNN::test_log_softmax_dim0_cuda, test/test_nn.py::TestNN::test_log_softmax_dim3, test/test_nn.py::TestNN::test_log_softmax_dim3_cuda, test/test_nn.py::TestNN::test_log_softmax_lastdim, test/test_nn.py::TestNN::test_log_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_log_softmax_scalar, test/test_nn.py::TestNN::test_log_softmax_scalar_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial, test/test_nn.py::TestNN::test_log_softmax_spatial_cuda, test/test_nn.py::TestNN::test_log_softmax_spatial_special, test/test_nn.py::TestNN::test_log_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_loss_equal_input_target_shape, test/test_nn.py::TestNN::test_margin_ranking_loss_margin_no_reduce, test/test_nn.py::TestNN::test_margin_ranking_loss_no_reduce, test/test_nn.py::TestNN::test_max_pool1d_invalid_output_size, test/test_nn.py::TestNN::test_module_apply_inplace_op, test/test_nn.py::TestNN::test_module_backcompat, test/test_nn.py::TestNN::test_module_super_init, test/test_nn.py::TestNN::test_module_to_argparse, test/test_nn.py::TestNN::test_modules, test/test_nn.py::TestNN::test_mse_loss_size_warning, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce, test/test_nn.py::TestNN::test_multimarginloss_1d_input_0d_target_no_reduce_cuda, test/test_nn.py::TestNN::test_named_children, test/test_nn.py::TestNN::test_named_modules, test/test_nn.py::TestNN::test_named_parameters_remove_duplicate, test/test_nn.py::TestNN::test_native_channel_shuffle_return_alias_of_self, test/test_nn.py::TestNN::test_nested_tensor_from_mask, test/test_nn.py::TestNN::test_nested_tensor_from_mask_error, test/test_nn.py::TestNN::test_no_grad, test/test_nn.py::TestNN::test_non_leaf_parameters, test/test_nn.py::TestNN::test_normalize, test/test_nn.py::TestNN::test_overwrite_module_params_on_conversion, test/test_nn.py::TestNN::test_pack_sequence_batch_sizes_throw, test/test_nn.py::TestNN::test_pad_scalar_error, test/test_nn.py::TestNN::test_padding_list, test/test_nn.py::TestNN::test_pairwise_distance, test/test_nn.py::TestNN::test_parameter_assignment, test/test_nn.py::TestNN::test_parameterlistdict_pickle, test/test_nn.py::TestNN::test_parameterlistdict_setting_attributes, test/test_nn.py::TestNN::test_parameters_and_named_parameters, test/test_nn.py::TestNN::test_parameters_to_vector, test/test_nn.py::TestNN::test_parse_to, test/test_nn.py::TestNN::test_partial_flat_weights, test/test_nn.py::TestNN::test_pdist, test/test_nn.py::TestNN::test_pdist_cpu_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_cuda_gradgrad_unimplemented, test/test_nn.py::TestNN::test_pdist_empty_col, test/test_nn.py::TestNN::test_pdist_empty_row, test/test_nn.py::TestNN::test_pdist_large, test/test_nn.py::TestNN::test_pdist_zeros, test/test_nn.py::TestNN::test_pickle_module_no_weights_only_warning, test/test_nn.py::TestNN::test_pixel_shuffle_nhwc_cpu, test/test_nn.py::TestNN::test_pixel_shuffle_unshuffle, test/test_nn.py::TestNN::test_pointwise_loss_broadcast, test/test_nn.py::TestNN::test_pointwise_loss_target_grad_none_reduction, test/test_nn.py::TestNN::test_projections_errors_on_gru_and_rnn, test/test_nn.py::TestNN::test_projections_lstm_args_check, test/test_nn.py::TestNN::test_projections_lstm_check_device, test/test_nn.py::TestNN::test_projections_lstm_initial_hidden_state, test/test_nn.py::TestNN::test_register_buffer_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_register_buffer_raises_error_if_not_tensor, test/test_nn.py::TestNN::test_register_parameter_allows_overwriting_with_same_name, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_attr_exists, test/test_nn.py::TestNN::test_register_parameter_raises_error_if_name_is_not_string, test/test_nn.py::TestNN::test_relu_inplace_on_view, test/test_nn.py::TestNN::test_repr, test/test_nn.py::TestNN::test_requires_grad_, test/test_nn.py::TestNN::test_rnn_args_check, test/test_nn.py::TestNN::test_rnn_check_device, test/test_nn.py::TestNN::test_rnn_initial_hidden_state, test/test_nn.py::TestNN::test_rnn_weight_norm, test/test_nn.py::TestNN::test_set_submodule, test/test_nn.py::TestNN::test_share_memory, test/test_nn.py::TestNN::test_smoothl1loss_intergral_target, test/test_nn.py::TestNN::test_smoothl1loss_negative_beta_not_supported, test/test_nn.py::TestNN::test_softmax_functional_dim0, test/test_nn.py::TestNN::test_softmax_functional_dim0_cuda, test/test_nn.py::TestNN::test_softmax_functional_dim3, test/test_nn.py::TestNN::test_softmax_functional_dim3_cuda, test/test_nn.py::TestNN::test_softmax_functional_scalar, test/test_nn.py::TestNN::test_softmax_functional_scalar_cuda, test/test_nn.py::TestNN::test_softmax_lastdim, test/test_nn.py::TestNN::test_softmax_lastdim_cuda, test/test_nn.py::TestNN::test_softmax_lastdim_dtype, test/test_nn.py::TestNN::test_softmax_lastdim_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial, test/test_nn.py::TestNN::test_softmax_spatial_cuda, test/test_nn.py::TestNN::test_softmax_spatial_dtype, test/test_nn.py::TestNN::test_softmax_spatial_dtype_cuda, test/test_nn.py::TestNN::test_softmax_spatial_special, test/test_nn.py::TestNN::test_softmax_spatial_special_cuda, test/test_nn.py::TestNN::test_softmin, test/test_nn.py::TestNN::test_spectral_norm, test/test_nn.py::TestNN::test_spectral_norm_dim, test/test_nn.py::TestNN::test_spectral_norm_forward, test/test_nn.py::TestNN::test_spectral_norm_load_state_dict, test/test_nn.py::TestNN::test_spectral_norm_pickle, test/test_nn.py::TestNN::test_state_dict, test/test_nn.py::TestNN::test_swap_module_params_poisons_acc_grad, test/test_nn.py::TestNN::test_sync_batchnorm_accuracy_cuda, test/test_nn.py::TestNN::test_sync_batchnorm_backward_elemt, test/test_nn.py::TestNN::test_threshold_bfloat16_half, test/test_nn.py::TestNN::test_threshold_int, test/test_nn.py::TestNN::test_to, test/test_nn.py::TestNN::test_train_errors_for_invalid_mode, test/test_nn.py::TestNN::test_transformer_args_check, test/test_nn.py::TestNN::test_transformer_layer_args_check, test/test_nn.py::TestNN::test_transformerdecoder, test/test_nn.py::TestNN::test_transformerdecoderlayer, test/test_nn.py::TestNN::test_transformerdecoderlayer_gelu, test/test_nn.py::TestNN::test_triplet_margin_loss, test/test_nn.py::TestNN::test_triplet_margin_loss_no_reduce, test/test_nn.py::TestNN::test_triplet_margin_loss_swap, test/test_nn.py::TestNN::test_triplet_margin_loss_swap_no_reduce, test/test_nn.py::TestNN::test_type, test/test_nn.py::TestNN::test_unflatten, test/test_nn.py::TestNN::test_unflatten_invalid_arg, test/test_nn.py::TestNN::test_unfold_invalid_arg, test/test_nn.py::TestNN::test_upsamplingBilinear2d_spatial_invariance, test/test_nn.py::TestNN::test_upsamplingLinear1d, test/test_nn.py::TestNN::test_upsamplingLinear1d_spatial_invariance, test/test_nn.py::TestNN::test_upsamplingTrilinear3d_spatial_invariance, test/test_nn.py::TestNN::test_upsampling_bfloat16, test/test_nn.py::TestNN::test_upsampling_not_recompute_scale_factor, test/test_nn.py::TestNN::test_upsampling_small_scale, test/test_nn.py::TestNN::test_vector_to_parameters, test/test_nn.py::TestNN::test_weight_norm, test/test_nn.py::TestNN::test_weight_norm_pickle, test/test_nn.py::TestNN::test_weighted_huber_loss, test/test_nn.py::TestNN::test_weighted_l1_loss_with_weights, test/test_nn.py::TestNN::test_weighted_mse_loss, test/test_nn.py::TestNN::test_zero_grad, test/test_nn.py::TestFusionEval::test_fuse_module_eval_numerics, test/test_nn.py::TestConstantPadNd::test_constant_pad_nd, test/test_nn.py::TestConstantPadNd::test_preserves_memory_format, test/test_nn.py::TestAddRelu::test_add_relu, test/test_nn.py::TestAddRelu::test_add_relu_broadcasting, test/test_nn.py::TestFunctionalPickle::test_pickle_softsign, test/test_nn.py::TestFusionUtils::test_fuse_conv_bn_requires_grad, test/test_nn.py::TestFusionUtils::test_fuse_linear_bn_requires_grad, test/test_nn.py::TestUtils::test_consume_prefix_in_state_dict_if_present, test/test_nn.py::TestNNDeviceTypeCUDA::test_BatchNorm_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_Bilinear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_empty_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_mean_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_none_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_CTCLoss_no_batch_dim_reduction_sum_use_module_form_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GRU_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_memory_format_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_GroupNorm_raises_error_if_one_value_per_group_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm1d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm2d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_InstanceNorm3d_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_differentiable_backward_using_oneDNN_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_LSTM_grad_and_gradgrad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_general_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LayerNorm_numeric_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_LocalResponseNorm_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_MarginLoss_warnings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_complex64, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReflectionPad_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad1d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad2d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad3d_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_ReplicationPad_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerDecoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerEncoderLayer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_TransformerEncoder_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Transformer_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_Unfold_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_activations_bfloat16_half_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_adaptiveavg_pool1d_shmem_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate45_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotate90_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_2d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_affine_3d_rotateRandom_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_avg_pool_large_tensor_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_affine_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_eval_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_large_batch_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_simple_average_mixed_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_batchnorm_update_stats_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_channel_shuffle_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_error_if_nonfinite_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_0_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_4_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_False_norm_type_inf_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_0_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_1_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_2_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_4_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_foreach_True_norm_type_inf_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_norm_multi_device_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_clip_grad_value_foreach_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_conv_empty_input_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_64bit_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_consistent_index_target_and_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_errors_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_weight_ignore_indices_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_label_smoothing_with_probs_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_large_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_2d_out_of_bounds_class_index_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_index_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_one_hot_target_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_all_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_mean_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_none_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_sum_weighted_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_no_batch_dim_reduction_sum_weighted_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_cross_entropy_loss_prob_target_unit_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_ctc_loss_cudnn_tensor_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_device_mask_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_elu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_elu_inplace_with_neg_alpha_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_fold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_glu_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_bfloat16_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_half_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_2d_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_2d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_large_index_3d_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_grid_sample_nan_inf_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_groupnorm_nhwc_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_gumbel_softmax_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardsigmoid_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_hardswish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_for_single_spatial_element_during_training_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm1d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm2d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_False_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_True_affine_False_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_input_channels_is_not_num_features_InstanceNorm3d_no_batch_dim_True_affine_True_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_instancenorm_raises_error_if_less_than_one_value_per_channel_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_invalid_reduction_strings_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_layernorm_half_precision_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_layernorm_weight_bias_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_with_neg_slope_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_leaky_relu_inplace_with_zero_slope_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_linear_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_big_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_log_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_logsigmoid_out_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_lstmcell_backward_only_one_output_grad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_TxT_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_devices_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_forward_with_nans_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_grad_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_lowp_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_mask_types_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_masked_softmax_transformer_layout_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_mish_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_module_to_empty_non_recursive_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_all_ignored_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_byte_target_matches_long_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_mean_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_none_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_empty_tensor_reduction_sum_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_target_dim_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_invalid_weights_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_mean_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_none_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_large_tensor_reduction_sum_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_mismatched_batch_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_out_of_bounds_ignore_index_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nll_loss_total_weight_is_zero_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_empty_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nn_scalars_reductions_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_nonlinearity_propagate_nan_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_one_hot_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_overwrite_module_params_on_conversion_cpu_device_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_pad_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_prelu_backward_32bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_replicatepad_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_fused_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_rnn_retain_variables_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_save_lstm_compatibility_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_silu_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_skip_init_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smooth_l1_loss_vs_huber_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_smoothl1loss_backward_zero_beta_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_backward_64bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_bfloat16_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_bfloat16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cpu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_forward_64bit_indexing_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_softmax_results_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softplus_low_threshold_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_softshrink_negative_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_threshold_inplace_overlap_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_to_complex_cuda_complex128, test/test_nn.py::TestNNDeviceTypeCUDA::test_to_complex_cuda_complex64, test/test_nn.py::TestNNDeviceTypeCUDA::test_to_complex_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_fast_path_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_gelu_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_transformerencoderlayer_gelu_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_triplet_margin_with_distance_loss_default_parity_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_True_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format0_align_corners_True_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_False_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_False_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_True_input_size_399_output_size_437_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiLinear2d_consistency_interp_size_bug_memory_format1_align_corners_True_input_size_403_output_size_377_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_False_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_False_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bicubic_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bilinear_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_antialias_True_align_corners_True_mode_bilinear_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format0_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bicubic_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_False_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_False_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_3_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_32_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_False_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_False_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_restrided_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_consistency_memory_format1_mode_bilinear_antialias_True_align_corners_True_num_channels_5_output_size_600_check_as_unsqueezed_3d_tensor_True_non_contig_sliced_batch_size_5_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_3_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_False_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_3_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bicubic_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_bilinear_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest-exact_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_float32_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_float64_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int16_cuda_int16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int32_cuda_int32, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int64_cuda_int64, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_int8_cuda_int8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBiMode2d_nonsupported_dtypes_antialias_True_num_channels_5_mode_nearest_uint8_cuda_uint8, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBicubic2d_correctness_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBilinear2d_aa_correctness_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingBilinear2d_aa_correctness_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest1d_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_fail_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_launch_rocm_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest2d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_launch_config_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format0_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest-exact_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearest3d_memory_format1_mode_nearest_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_correctness_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact1d_rescale_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact2d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format0_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_10_osize_15_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingNearestExact3d_correctness_memory_format1_isize_20_osize_11_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_False_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format0_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingTrilinear3d_align_corners_True_memory_format1_cuda, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsampling_64bit_indexing_channels_last_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_upsamplingnearest2d_backward_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float32, test/test_nn.py::TestNNDeviceTypeCUDA::test_variable_sequence_cuda_float64, test/test_nn.py::TestNNDeviceTypeCUDA::test_warp_softmax_64bit_indexing_cuda_float16, test/test_nn.py::TestNNDeviceTypeCUDA::test_warp_softmax_64bit_indexing_cuda_float32 2024-11-01T17:02:37.5161926Z 2024-11-01T17:02:37.5162338Z Running inductor/test_torchinductor 4/6 ... [2024-11-01 17:02:37.275170] 2024-11-01T17:02:37.5163076Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:02:37.5164396Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor.py', '-m', 'serial', '--shard-id=4', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:02:37.275547] 2024-11-01T17:02:47.9161053Z 2024-11-01T17:02:47.9162780Z inductor/test_torchinductor 4/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_4.6_709db2cd76cc7e59_.log 2024-11-01T17:02:47.9165926Z Running 0 items in this shard: 2024-11-01T17:02:47.9166262Z 2024-11-01T17:02:47.9166631Z Running test_ops 1/9 ... [2024-11-01 17:02:47.916126] 2024-11-01T17:02:47.9167142Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:02:47.9173278Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'serial', '--shard-id=1', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:02:47.916687] 2024-11-01T17:03:00.5627050Z 2024-11-01T17:03:00.5628738Z test_ops 1/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_1.9_8fc421cf3a9e07ac_.log 2024-11-01T17:03:00.5630049Z Running 0 items in this shard: 2024-11-01T17:03:00.5630471Z 2024-11-01T17:03:00.5630996Z Running test_ops 2/9 ... [2024-11-01 17:03:00.562601] 2024-11-01T17:03:00.5631875Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:03:00.5637606Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'serial', '--shard-id=2', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:03:00.563204] 2024-11-01T17:03:13.0575989Z 2024-11-01T17:03:13.0578434Z test_ops 2/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_2.9_8b20fe1383daf14f_.log 2024-11-01T17:03:13.0580181Z Running 0 items in this shard: 2024-11-01T17:03:13.0580705Z 2024-11-01T17:03:13.0581297Z Running test_ops 5/9 ... [2024-11-01 17:03:13.057513] 2024-11-01T17:03:13.0582229Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:03:13.0588573Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'serial', '--shard-id=5', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:03:13.058091] 2024-11-01T17:03:25.7031874Z 2024-11-01T17:03:25.7034386Z test_ops 5/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_5.9_f9c9cbdf89a1596c_.log 2024-11-01T17:03:25.7036417Z Running 0 items in this shard: 2024-11-01T17:03:25.7036956Z 2024-11-01T17:03:25.7037466Z Running test_ops 6/9 ... [2024-11-01 17:03:25.703101] 2024-11-01T17:03:25.7038394Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:03:25.7042388Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'serial', '--shard-id=6', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:03:25.703664] 2024-11-01T17:03:38.2480473Z 2024-11-01T17:03:38.2481467Z test_ops 6/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_6.9_600b267d47c68cfd_.log 2024-11-01T17:03:38.2485077Z Running 0 items in this shard: 2024-11-01T17:03:38.2485398Z 2024-11-01T17:03:38.2488923Z Running test_ops 9/9 ... [2024-11-01 17:03:38.248025] 2024-11-01T17:03:38.2489458Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:03:38.2490990Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'serial', '--shard-id=9', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:03:38.248390] 2024-11-01T17:03:50.7427897Z 2024-11-01T17:03:50.7429624Z test_ops 9/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_9.9_a7def968ad219401_.log 2024-11-01T17:03:50.7431186Z Running 0 items in this shard: 2024-11-01T17:03:50.7431663Z 2024-11-01T17:03:50.7432250Z Running test_binary_ufuncs 1/1 ... [2024-11-01 17:03:50.742680] 2024-11-01T17:03:50.7433206Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:03:50.7436958Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_binary_ufuncs.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:03:50.743317] 2024-11-01T17:03:58.7780755Z 2024-11-01T17:03:58.7783310Z test_binary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_binary_ufuncs_1.1_30dda148dc253a67_.log 2024-11-01T17:03:58.7785533Z Running 0 items in this shard: 2024-11-01T17:03:58.7786177Z 2024-11-01T17:03:58.7786910Z Running test_unary_ufuncs 1/1 ... [2024-11-01 17:03:58.777826] 2024-11-01T17:03:58.7788084Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:03:58.7791239Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_unary_ufuncs.py', '-m', 'serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:03:58.778425] 2024-11-01T17:04:09.6716797Z 2024-11-01T17:04:09.6718064Z test_unary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_unary_ufuncs_1.1_25983815dd7d64b7_.log 2024-11-01T17:04:09.6719625Z Running 0 items in this shard: 2024-11-01T17:04:09.6720217Z 2024-11-01T17:04:09.6787714Z Running test_ops 1/9 ... [2024-11-01 17:04:09.677509] 2024-11-01T17:04:09.6788949Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:04:09.6792526Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'not serial', '--shard-id=1', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:04:09.678150] 2024-11-01T17:04:09.6794191Z Running inductor/test_torchinductor 4/6 ... [2024-11-01 17:04:09.678608] 2024-11-01T17:04:09.6794782Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:04:09.6806923Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'inductor/test_torchinductor.py', '-m', 'not serial', '--shard-id=4', '--num-shards=6', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:04:09.680200] 2024-11-01T17:12:18.7499304Z 2024-11-01T17:12:18.7510200Z inductor/test_torchinductor 4/6 was successful, full logs can be found in artifacts with path test/test-reports/inductor.test_torchinductor_4.6_28417c6104091bb9_.log 2024-11-01T17:12:18.7586953Z Running 280 items in this shard: test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_broadcast2_dense, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_broadcast2_double, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_broadcast2_strided, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_broadcast3_broadcast1, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_broadcast3_broadcast3, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_dense_dense, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_int_int, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_strided_broadcast2, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_strided_double, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_transposed_broadcast2, test/inductor/test_torchinductor.py::SweepInputsCpuTest::test_cpu_transposed_transposed, test/inductor/test_torchinductor.py::CpuTests::test_AllenaiLongformerBase_repro_cpu, test/inductor/test_torchinductor.py::CpuTests::test_add_complex_cpu, test/inductor/test_torchinductor.py::CpuTests::test_aoti_eager_dtype_device_layout_cpu, test/inductor/test_torchinductor.py::CpuTests::test_aoti_eager_with_scalar_cpu, test/inductor/test_torchinductor.py::CpuTests::test_arange4_cpu, test/inductor/test_torchinductor.py::CpuTests::test_argmax_argmin3_cpu, test/inductor/test_torchinductor.py::CpuTests::test_argmax_argmin_with_duplicates_cpu, test/inductor/test_torchinductor.py::CpuTests::test_avg_pool2d2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_avg_pool3d_backward3_cpu, test/inductor/test_torchinductor.py::CpuTests::test_bernoulli2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_bitwise_cpu, test/inductor/test_torchinductor.py::CpuTests::test_bucketize_default_kwargs_cpu, test/inductor/test_torchinductor.py::CpuTests::test_cat_cpu, test/inductor/test_torchinductor.py::CpuTests::test_cat_extern_kernel_cpu, test/inductor/test_torchinductor.py::CpuTests::test_cat_uint8_cpu, test/inductor/test_torchinductor.py::CpuTests::test_cat_unbacked_empty_1d_cpu, test/inductor/test_torchinductor.py::CpuTests::test_clamp_cpu, test/inductor/test_torchinductor.py::CpuTests::test_constant_pad_3d_cpu, test/inductor/test_torchinductor.py::CpuTests::test_conv2d_channels_last_cpu, test/inductor/test_torchinductor.py::CpuTests::test_cudnn_rnn_cpu, test/inductor/test_torchinductor.py::CpuTests::test_cumprod_zero_dim_cpu, test/inductor/test_torchinductor.py::CpuTests::test_cumsum_cpu, test/inductor/test_torchinductor.py::CpuTests::test_div3_cpu, test/inductor/test_torchinductor.py::CpuTests::test_div6_cpu, test/inductor/test_torchinductor.py::CpuTests::test_div9_cpu, test/inductor/test_torchinductor.py::CpuTests::test_div_zero_dim_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtype_sympy_expr_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_float16_float16_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_float32_int16_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_float32_uint8_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_float64_float32_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_int16_bfloat16_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_int16_float32_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_int16_float64_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_int16_int8_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_int32_int8_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_int64_float32_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_int64_int32_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_int8_float64_cpu, test/inductor/test_torchinductor.py::CpuTests::test_dtypeview_uint8_int8_cpu, test/inductor/test_torchinductor.py::CpuTests::test_expand_cpu, test/inductor/test_torchinductor.py::CpuTests::test_fallback_mutable_op_basic_cpu, test/inductor/test_torchinductor.py::CpuTests::test_fallback_mutable_op_no_mutated_tensors_cpu, test/inductor/test_torchinductor.py::CpuTests::test_fill2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_flip_cat_cpu, test/inductor/test_torchinductor.py::CpuTests::test_fractional_max_pool2d2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_fractional_max_pool2d3_cpu, test/inductor/test_torchinductor.py::CpuTests::test_grid_sampler_2d_cpu, test/inductor/test_torchinductor.py::CpuTests::test_index1_cpu, test/inductor/test_torchinductor.py::CpuTests::test_index_propagation_cpu, test/inductor/test_torchinductor.py::CpuTests::test_index_put_as_masked_fill_cpu, test/inductor/test_torchinductor.py::CpuTests::test_inf_cpu, test/inductor/test_torchinductor.py::CpuTests::test_inner_fn_str_and_stride_cpu, test/inductor/test_torchinductor.py::CpuTests::test_input_mutation3_cpu, test/inductor/test_torchinductor.py::CpuTests::test_input_mutation4_cpu, test/inductor/test_torchinductor.py::CpuTests::test_int_input_dynamic_shapes_cpu, test/inductor/test_torchinductor.py::CpuTests::test_kernel_names_cpu, test/inductor/test_torchinductor.py::CpuTests::test_large_grid_cpu, test/inductor/test_torchinductor.py::CpuTests::test_large_pointwise_cpu, test/inductor/test_torchinductor.py::CpuTests::test_large_tensor_reduction_cpu, test/inductor/test_torchinductor.py::CpuTests::test_linear2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_linspace1_cpu, test/inductor/test_torchinductor.py::CpuTests::test_max_pool2d7_cpu, test/inductor/test_torchinductor.py::CpuTests::test_mm_mixed_dtype_cpu, test/inductor/test_torchinductor.py::CpuTests::test_move_arange_cpu, test/inductor/test_torchinductor.py::CpuTests::test_multilayer_var_cpu, test/inductor/test_torchinductor.py::CpuTests::test_mutable_custom_op_fixed_layout2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_new_ones_cpu, test/inductor/test_torchinductor.py::CpuTests::test_nll_loss_backward_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pointwise_chebyshev_polynomial_v_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pointwise_erfinv_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pointwise_laguerre_polynomial_l_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pointwise_log1p_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pointwise_modified_bessel_i0_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pointwise_polygamma_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pointwise_scaled_modified_bessel_k1_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pointwise_shifted_chebyshev_polynomial_u_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pow1_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pow2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pow_int_cpu, test/inductor/test_torchinductor.py::CpuTests::test_pow_symfloat_cpu, test/inductor/test_torchinductor.py::CpuTests::test_randint_kernel_count_cpu, test/inductor/test_torchinductor.py::CpuTests::test_randn_with_dtype_and_device_cpu, test/inductor/test_torchinductor.py::CpuTests::test_reduction5_cpu, test/inductor/test_torchinductor.py::CpuTests::test_reduction_config_limit_cpu, test/inductor/test_torchinductor.py::CpuTests::test_remainder_cpu, test/inductor/test_torchinductor.py::CpuTests::test_repeat_interleave_2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_require_stride_expanded_cpu, test/inductor/test_torchinductor.py::CpuTests::test_scalar_cpu_tensor_arg_cpu, test/inductor/test_torchinductor.py::CpuTests::test_scatter4_cpu, test/inductor/test_torchinductor.py::CpuTests::test_sdpa_unaligned_mask_freezing_cpu, test/inductor/test_torchinductor.py::CpuTests::test_sin_cpu, test/inductor/test_torchinductor.py::CpuTests::test_single_elem_cpu, test/inductor/test_torchinductor.py::CpuTests::test_sizehint_issue1_cpu, test/inductor/test_torchinductor.py::CpuTests::test_slice_mutation2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_slice_scatter_cpu, test/inductor/test_torchinductor.py::CpuTests::test_softmax_one_kernel_persist_cpu, test/inductor/test_torchinductor.py::CpuTests::test_sort_transpose_cpu, test/inductor/test_torchinductor.py::CpuTests::test_split_cumprod_low_prec_cpu, test/inductor/test_torchinductor.py::CpuTests::test_stride_preservation_with_stride_modifying_fx_pass_cpu, test/inductor/test_torchinductor.py::CpuTests::test_sum2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_tan_cpu, test/inductor/test_torchinductor.py::CpuTests::test_tensor2_cpu, test/inductor/test_torchinductor.py::CpuTests::test_tensor3_cpu, test/inductor/test_torchinductor.py::CpuTests::test_tmp_not_defined_issue1_cpu, test/inductor/test_torchinductor.py::CpuTests::test_unspec_inputs_bfloat16_cpu, test/inductor/test_torchinductor.py::CpuTests::test_unspec_inputs_float16_cpu, test/inductor/test_torchinductor.py::CpuTests::test_upsample_bilinear2d_a_cpu, test/inductor/test_torchinductor.py::CpuTests::test_upsample_bilinear2d_b_cpu, test/inductor/test_torchinductor.py::CpuTests::test_upsample_cat_conv_cpu, test/inductor/test_torchinductor.py::CpuTests::test_vectorized_ops_masked_cpu, test/inductor/test_torchinductor.py::CpuTests::test_vertical_fusion1_cpu, test/inductor/test_torchinductor.py::CpuTests::test_views4_cpu, test/inductor/test_torchinductor.py::CpuTests::test_zero_dim_reductions_cpu, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast1_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast2_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_broadcast3, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_broadcast3_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_double_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_dense, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_int_double, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_broadcast1, test/inductor/test_torchinductor.py::SweepInputsGPUTest::test_cuda_transposed_dense, test/inductor/test_torchinductor.py::GPUTests::test__unsafe_masked_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_avg_pool2d_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adaptive_max_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_adding_tensor_offsets_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_dtype_device_layout_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_support_out_cuda, test/inductor/test_torchinductor.py::GPUTests::test_aoti_eager_support_str_cuda, test/inductor/test_torchinductor.py::GPUTests::test_argmax_min_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_as_strided_scatter_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool2d_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_avg_pool3d_backward3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bernoulli1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_bmm1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_batch_norm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_buffer_use_after_remove_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_pos_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_float_ndigits_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_builtins_round_int_ndigits_zero_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_extern_kernel_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_negative_dim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cat_unbacked_2d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_consecutive_split_cumsum_cuda, test/inductor/test_torchinductor.py::GPUTests::test_constant_pad_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_convolution2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_cuda, test/inductor/test_torchinductor.py::GPUTests::test_cumsum_no_mask_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_op_2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_scan_op_multi_input_cuda, test/inductor/test_torchinductor.py::GPUTests::test_custom_scan_would_split_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dist_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div5_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_div_prim_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dropout_deterministic_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtype_sympy_expr_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_bfloat16_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float16_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float32_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_float64_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int16_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int32_int8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int64_int64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_float32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_int8_int32_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_float16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_dtypeview_uint8_float64_cuda, test/inductor/test_torchinductor.py::GPUTests::test_empty1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_flip_cuda, test/inductor/test_torchinductor.py::GPUTests::test_forced_buffer_realize_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_fractional_max_pool2d4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_getitem_cuda, test/inductor/test_torchinductor.py::GPUTests::test_hardsigmoid_cuda, test/inductor/test_torchinductor.py::GPUTests::test_horizonal_fusion2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_index_put_deterministic_fallback_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inf_cuda, test/inductor/test_torchinductor.py::GPUTests::test_inner_fn_str_and_stride_cuda, test/inductor/test_torchinductor.py::GPUTests::test_input_mutation3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_issue102546_cuda, test/inductor/test_torchinductor.py::GPUTests::test_l1_loss_cuda, test/inductor/test_torchinductor.py::GPUTests::test_large_broadcast_reduction_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_linspace2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_log_softmax_cuda, test/inductor/test_torchinductor.py::GPUTests::test_low_memory_max_pool_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_min_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward4_cuda, test/inductor/test_torchinductor.py::GPUTests::test_max_pool2d_with_indices_backward6_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mixed_mm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mm_views_cuda, test/inductor/test_torchinductor.py::GPUTests::test_multilayer_var_cuda, test/inductor/test_torchinductor.py::GPUTests::test_mutable_custom_op_fixed_layout2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_neg_index_cuda, test/inductor/test_torchinductor.py::GPUTests::test_neg_max_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nll_loss_backward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_nll_loss_forward_cuda, test/inductor/test_torchinductor.py::GPUTests::test_no_mega_fusion_during_lowering_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_j0_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_bessel_j1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_u_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_chebyshev_polynomial_v_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_digamma_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_gammaincc_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_hermite_polynomial_he_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_shifted_chebyshev_polynomial_u_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pointwise_xlogy_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_pow_int_cuda, test/inductor/test_torchinductor.py::GPUTests::test_randint_int64_mod_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reduction3_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reinterpret_dtypeview_cuda, test/inductor/test_torchinductor.py::GPUTests::test_relu_cuda, test/inductor/test_torchinductor.py::GPUTests::test_reuse_buffers_with_aliasing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_roll_cuda, test/inductor/test_torchinductor.py::GPUTests::test_rsqrt_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scalar_output_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scaled_dot_product_attention_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_scatter_reduce1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_prefer_nd_tiling_True_use_block_ptr_False_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sdpa_unaligned_mask_freezing_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sgn_extremal_cuda, test/inductor/test_torchinductor.py::GPUTests::test_shape_padding_cuda, test/inductor/test_torchinductor.py::GPUTests::test_should_pad_bench_for_bmm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sigmoid_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sin_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_mutation1_cuda, test/inductor/test_torchinductor.py::GPUTests::test_slice_scatter2_cuda, test/inductor/test_torchinductor.py::GPUTests::test_sort_bool_cuda, test/inductor/test_torchinductor.py::GPUTests::test_split_cumprod_low_prec_cuda, test/inductor/test_torchinductor.py::GPUTests::test_squeeze_varargs_cuda, test/inductor/test_torchinductor.py::GPUTests::test_transposed_propagates_cuda, test/inductor/test_torchinductor.py::GPUTests::test_uint4x2_mixed_mm_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_bfloat16_cuda, test/inductor/test_torchinductor.py::GPUTests::test_unspec_inputs_uint8_cuda, test/inductor/test_torchinductor.py::GPUTests::test_upsample_nearest3d_cuda, test/inductor/test_torchinductor.py::GPUTests::test_vectorized_ops_masked_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_detach_cuda, test/inductor/test_torchinductor.py::GPUTests::test_view_uint8_through_differing_bitwidths_cuda, test/inductor/test_torchinductor.py::GPUTests::test_views5_cuda, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_cant_optimize_compute, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_ctr_not_moved_to_cuda_when_used_in_index_put, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_evict_last_non_coalesced_loads_block_ptr, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_not_materialize_pointwise_reduction, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_assert, test/inductor/test_torchinductor.py::TritonCodeGenTests::test_optimize_indexing_dtype_with_constraint, test/inductor/test_torchinductor.py::NanCheckerTest::test_nan_checker_pass 2024-11-01T17:12:18.7661967Z 2024-11-01T17:12:18.7662218Z Running test_ops 2/9 ... [2024-11-01 17:12:18.751985] 2024-11-01T17:12:18.7662625Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:12:18.7663860Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'not serial', '--shard-id=2', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:12:18.752377] 2024-11-01T17:12:18.7665066Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2024-11-01T17:12:18.7665562Z Uploading artifacts took 0.00 seconds 2024-11-01T17:13:20.0915694Z 2024-11-01T17:13:20.0921997Z test_ops 1/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_1.9_cbd2293ca8441106_.log 2024-11-01T17:13:20.2154594Z Running 3685 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_det_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_var_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_angle_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nansum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_slice_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_squeeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_transpose_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unsafe_split_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes_T_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rmul___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___ror___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_abs_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_all_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_any_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clone_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_flip_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_frac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_gt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isnan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log1p_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_list_of_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_randn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_logit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_std_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tensor_split_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unsqueeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__segment_reduce_lengths_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addbmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_conj_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cummax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_equal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_exp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_eye_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_float_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_frac_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_inner_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_istft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eig_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_factor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_pinv_hermitian_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log1p_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_long_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_neg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_channel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_dropout3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_linear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_constant_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_replicate_negative_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_scaled_dot_product_attention_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_general_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_slice_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_slice_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tile_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_cuda, test/test_ops.py::TestCommonCUDA::test_errors_T_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rdiv___cuda, test/test_ops.py::TestCommonCUDA::test_errors___rmod___cuda, test/test_ops.py::TestCommonCUDA::test_errors_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diag_cuda, test/test_ops.py::TestCommonCUDA::test_errors_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_errors_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_errors_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_errors_histogramdd_cuda, test/test_ops.py::TestCommonCUDA::test_errors_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_errors_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_errors_mean_cuda, test/test_ops.py::TestCommonCUDA::test_errors_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_polar_cuda, test/test_ops.py::TestCommonCUDA::test_errors_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout0_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_errors_unbind_cuda, test/test_ops.py::TestCommonCUDA::test_errors_view_cuda, test/test_ops.py::TestCommonCUDA::test_errors_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmod___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_permuted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gather_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_median_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_legendre_polynomial_p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapz_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___rand___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___rxor___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fmin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_full_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_hstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_put_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_kron_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mH_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ones_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_take_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tril_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___radd___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmatmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__chunk_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argwhere_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cov_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_einsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_unpack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_constant_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_negative_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_nuc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_h_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensordot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_hann_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tile_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_histc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_inner_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_det_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_inv_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sparse_sampled_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unravel_index_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning_T_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___getitem___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_polar_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_asin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flip_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_gt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_imag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_item_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_permute_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sinc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_i1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_std_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sum_to_size_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tensor_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_var_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clone_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_combinations_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumulative_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_einsum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_equal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expand_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_histc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_inner_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_int_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_4inputs_with_extra_args_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cholesky_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_det_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lstsq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_singular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_slogdet_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mH_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_median_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mode_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nanmedian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_native_dropout_backward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_neg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_binary_cross_entropy_with_logits_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_cross_entropy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_grid_sample_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardswish_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_linear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool1d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_silu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_normal_in_place_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pca_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_permute_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randint_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resize__cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resize_as__cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resolve_neg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scalar_tensor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_i1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_stft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_to_sparse_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch_ops_aten__efficient_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_transpose_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tril_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unfold_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_as_real_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_zero__cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exponential_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_t_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gcd_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_imag_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lcm_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_normal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_renorm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_polar_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lcm_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rsub___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__chunk_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_einsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ldexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vander_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_matmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_constant_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_negative_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_softsign_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_nuc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_qr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_slice_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensordot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapz_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_copy_cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_bfloat16_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_max_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cauchy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isposinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_minimum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_combinations_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isreal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_j1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_true_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erfc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_expm1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_xor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unsafe_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_vsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_eigh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_matmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_zeta_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view___getitem___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_acos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_exp2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_imag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_index_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_istft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_vecdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_mul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_pow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_renorm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_allclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_angle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_clone_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_conj_physical_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diag_embed_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_hsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_put_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isreal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_kron_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eig_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vecdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logcumsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_or_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_normalize_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_var_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_narrow_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_inf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_qr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ravel_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reciprocal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rsqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_svd_lowrank_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_zeros_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_H_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_flip_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_float_power_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_full_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_permute_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_ravel_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reshape_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_rot90_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_rsub_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sigmoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sub_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_tensor_split_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unbind_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_view_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_where_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addbmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_as_strided_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_bfloat16_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_broadcast_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cfloat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_chalf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cholesky_inverse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diag_embed_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_einsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expm1_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_eye_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_int_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_istft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lerp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cholesky_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_cholesky_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_ldl_factor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lstsq_grad_oriented_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_and_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_or_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_cumprod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_feature_alpha_dropout_without_train_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_normalize_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_replicate_negative_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nonzero_static_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_pca_lowrank_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_qr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_real_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resize__cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_with_sizes_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tensordot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_triu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rmod___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_polar_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_short_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_acosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_conj_physical_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cumsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_expm1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_irfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isneginf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_cross_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_lt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_narrow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_permute_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_pow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_repeat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rsub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_xlog1py_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_view_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_where_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_xlogy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__segment_reduce_lengths_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__unsafe_masked_index_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_acosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_all_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_arange_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atleast_1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_chalf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cholesky_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_clamp_max_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_constant_pad_nd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_corrcoef_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cummax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fmod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_geometric_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_inner_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isposinf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isreal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigvalsh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_factor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_qr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_unpack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nanmean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_max_pool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_max_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_avg_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cosine_similarity_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_dropout2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_elu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_gaussian_nll_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_gelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_nearest_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_kl_div_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_prelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_silu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_real_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_remainder_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_round_decimals_neg_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_blackman_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_kaiser_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_slice_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sparse_sampled_addmm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_y1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_i0e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_split_with_sizes_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_squeeze_multiple_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_stft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_transpose_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unique_consecutive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_all_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_aminmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_eq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_eye_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_igamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_in_place_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randint_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_triu_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cov_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_frac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_matmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_pinverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_real_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_t_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsqueeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_multinomial_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_transpose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_gcd_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eigvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_long_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polar_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rand_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randint_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sgn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_slice_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_t_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsqueeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_complex32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int8, test/test_ops.py::TestTagsCUDA::test_tags_H_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rand___cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_bool_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_addcmul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_cauchy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clamp_min_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_clone_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_fft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flip_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_frexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_geometric_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_heaviside_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_imag_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags__refs_isreal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_lgamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_lt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ne_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_zeros_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_pow_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_real_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_roll_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_i1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_std_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tril_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_where_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_aminmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_or_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_block_diag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_broadcast_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bucketize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cholesky_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_column_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_count_nonzero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cross_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diag_embed_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dist_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erfc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_eye_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_floor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_hstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_igammac_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_put_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_reduce_amin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_kron_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_det_singular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_slogdet_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_solve_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_softmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_matmul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_max_binary_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_empty_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_gelu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nonzero_static_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rad2deg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_renorm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resolve_conj_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rot90_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_slice_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_j1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_zeta_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_squeeze_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sum_to_size_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tile_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_transpose_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tril_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_as_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_vsplit_cuda_float32 2024-11-01T17:13:20.3258391Z 2024-11-01T17:13:20.3258688Z Running test_ops 5/9 ... [2024-11-01 17:13:20.100274] 2024-11-01T17:13:20.3259095Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:13:20.3260359Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'not serial', '--shard-id=5', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:13:20.100856] 2024-11-01T17:20:27.6746086Z 2024-11-01T17:20:27.6748115Z test_ops 2/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_2.9_777694d836aa994e_.log 2024-11-01T17:20:27.8067424Z Running 3736 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_alias_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_partial_views_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_asinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_eq_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_permute_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_int_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_partial_views_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atleast_1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_expand_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_expand_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_i0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_not_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_selu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_positive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sqrt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_t_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__upsample_bilinear2d_aa_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_arange_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argsort_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atleast_1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_chalf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cholesky_inverse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cummin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_flip_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_floor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_grid_sampler_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_put_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cholesky_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cond_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lstsq_grad_oriented_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_lu_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_vander_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log10_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_reduction_no_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_movedim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_msort_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_batch_norm_without_cudnn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv_transpose3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_cosine_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_dropout2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_fractional_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_silu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_roll_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_hann_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sparse_mm_reduce_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i0e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__efficient_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tril_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_real_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rsub___cuda, test/test_ops.py::TestCommonCUDA::test_errors___rxor___cuda, test/test_ops.py::TestCommonCUDA::test_errors_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diff_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dot_cuda, test/test_ops.py::TestCommonCUDA::test_errors_eq_cuda, test/test_ops.py::TestCommonCUDA::test_errors_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_errors_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_errors_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_errors_median_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_gaussian_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_errors_rot90_cuda, test/test_ops.py::TestCommonCUDA::test_errors_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_kaiser_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_chebyshev_polynomial_u_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rsub___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__chunk_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bincount_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_combinations_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cov_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_digamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_geqrf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_item_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softsign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scalar_tensor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_k1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_xlog1py_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trapz_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_alias_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_clamp_max_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_combinations_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_double_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_jiterator_2inputs_2outputs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_mT_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_min_reduction_no_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_softsign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_slice_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_y0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_scaled_modified_bessel_k1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_zeros_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___getitem___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gradient_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kron_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cond_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vander_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vector_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mT_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_reduction_no_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_linear_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_tanhshrink_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_interleave_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_searchsorted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_topk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_addbmm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clone_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vecdot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_rms_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_searchsorted_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_blackman_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_out__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_argwhere_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_integral_dtype__refs_prod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_out_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nansum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_fro_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_list_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_as_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_warning___rmod___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_abs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_eq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_expand_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_expand_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_floor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_channel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_normal__in_place_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_round_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_multigammaln_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addbmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_angle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argsort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atleast_3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cdist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cfloat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_contiguous_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagflat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_exp2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expm1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_float_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_geqrf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_hstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_imag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_istft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_item_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_binary_return_by_ref_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_le_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_ldl_factor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_rank_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_tensorinv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_tensorsolve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logdet_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lu_unpack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_select_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_binary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_embedding_bag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_embedding_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hardsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool3d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_silu_complex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ormqr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polar_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_quantile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randn_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_sum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_select_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sgn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sparse_sampled_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_hermite_polynomial_he_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_v_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_shifted_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_square_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_take_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triangular_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unique_consecutive_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float___rdiv___cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_ldexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_legendre_polynomial_p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_frac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nextafter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_clamp_min_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_dsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ifft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_all_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isfinite_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_istft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svd_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_alpha_dropout_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_leaky_relu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pdist_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_randn_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_float_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lcm_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hinge_embedding_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cauchy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exponential_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pdist_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int8, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cov_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isreal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_circular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ones_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_put_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scalar_tensor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_blackman_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_svd_lowrank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triangular_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zero__cuda_complex64, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_expand_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_det_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_max_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addmm_decomposed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argwhere_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_le_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_det_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eigvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logical_or_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clamp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_clone_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_narrow_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sum_to_size_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_zero__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_argmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_full_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_inv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nansum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nextafter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_normal_number_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_t_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_alias_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_allclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_as_strided_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_floor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_not_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_roll_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_short_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_squeeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_stft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_t_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_where_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view___radd___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_T_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_float_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_chunk_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_contiguous_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cumprod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_fft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_irfftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_hstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isfinite_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_roll_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sigmoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tensor_split_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_acosh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addbmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmm_decomposed_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_asin_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_block_diag_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_bool_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_column_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_exp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_eye_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_det_singular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eigh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_inv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lstsq_grad_oriented_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_tensorsolve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mH_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_prod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_sum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nansum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_unshuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_triplet_margin_with_distance_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_static_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ones_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ormqr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pinverse_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_randn_like_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sqrt_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_squeeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sum_to_size_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_transpose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_uniform_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rmatmul___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_byte_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cdouble_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_acos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_acosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_addcmul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_atanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cumsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diag_embed_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_eq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_exp2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_hsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isinf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_neg_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_real_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sinc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_t_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_to_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addcdiv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addmv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_angle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cholesky_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_count_nonzero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_double_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_permuted_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_hfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_flipud_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_det_singular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_eigvalsh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_hermitian_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_vecdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_log10_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_long_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nansum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_linear_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_constant_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_inf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_nuc_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_permute_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_permute_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resolve_conj_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_rsub_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sigmoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_mean_unbiased_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_stft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_t_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_tan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_trapezoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_var_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rmul___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view___rsub___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__batch_norm_with_update_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__native_batch_norm_legit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_bool_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_char_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_addr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_alias_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_broadcast_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_conj_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cosh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_dsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_exponential_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_eye_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ihfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_ge_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_igammac_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isfinite_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linspace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_and_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_xor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_masked_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_rsqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_std_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unbind_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_view_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addmv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_angle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atleast_2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_broadcast_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cauchy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_complex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cov_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagflat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dstack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expand_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_hfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_half_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isneginf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ldexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cond_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lstsq_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_factor_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_singular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logcumsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_std_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_max_reduction_with_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_narrow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_empty_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardswish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bilinear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multilabel_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_renorm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_reshape_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_roll_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_prod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_short_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_general_cosine_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_hamming_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_i1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_zeta_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_std_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_take_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_trapezoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unbind_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_as_complex_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_any_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_arange_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_long_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unsqueeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_as_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_zeros_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_left_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_right_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_det_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_glu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_t_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lstsq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_selu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_erfcx_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_xlogy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_eq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_float_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_xor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_new_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nonzero_static_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_inf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unravel_index_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___getitem___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___ror___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_alias_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_left_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_contiguous_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_eq_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_unary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log1p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ravel_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_zeta_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tril_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_zeros_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int16, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_float_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_long_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_arange_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_as_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bucketize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_conj_physical_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_div_trunc_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_dot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_dsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_fftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_hfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_irfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hstack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_le_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_vecdot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_and_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_permute_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_repeat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_reshape_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sigmoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tensor_split_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_alias_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_all_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atleast_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clamp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clamp_max_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_dot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_equal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_igamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_reduce_amax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_int_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isnan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_isneginf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_inv_ex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_median_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mul_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ne_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_neg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_new_full_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_selu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softsign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_outer_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pinverse_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_hamming_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_entr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_square_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_squeeze_multiple_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tensor_split_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_tril_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_unfold_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unravel_index_cuda_int64 2024-11-01T17:20:27.9168315Z 2024-11-01T17:20:27.9168609Z Running test_ops 6/9 ... [2024-11-01 17:20:27.683587] 2024-11-01T17:20:27.9169006Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:20:27.9170222Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'not serial', '--shard-id=6', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:20:27.684182] 2024-11-01T17:21:36.6376346Z 2024-11-01T17:21:36.6381895Z test_ops 5/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_5.9_0926987bb1868889_.log 2024-11-01T17:21:36.7773569Z Running 3712 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_cfloat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing__chunk_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_acosh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_fftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_lerp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_narrow_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sub_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_unbind_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_zeros_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___radd___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rdiv___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes___rsub___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__native_batch_norm_legit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_bfloat16_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_half_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_long_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_acosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_allclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_atanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_deg2rad_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_equal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_ge_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_istft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logaddexp2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_mul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_alpha_dropout_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pixel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_permute_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_real_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_entr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tril_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atleast_2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_contiguous_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cos_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_rfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_flatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_full_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histogram_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histogramdd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_i0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_int_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_lcm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_det_singular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_eigvalsh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_power_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_rank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_norm_subgradients_at_zero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_pinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_qr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_max_pool2d_with_indices_backward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mvlgamma_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nanquantile_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_ctc_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_nearest-exact_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_trilinear_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_number_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_permute_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_permute_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_put_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ravel_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_repeat_interleave_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_resize__cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_round_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_round_decimals_0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scalar_tensor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_short_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_legendre_polynomial_p_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_scaled_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_split_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_split_with_sizes_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_stack_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch__scaled_mm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_triangular_solve_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_uniform_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unique_consecutive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unravel_index_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_zeros_like_cuda, test/test_ops.py::TestCommonCUDA::test_errors___radd___cuda, test/test_ops.py::TestCommonCUDA::test_errors__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_errors_amax_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_left_shift_cuda, test/test_ops.py::TestCommonCUDA::test_errors_cat_cuda, test/test_ops.py::TestCommonCUDA::test_errors_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_no_rounding_mode_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_trunc_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_errors_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fmod_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gt_cuda, test/test_ops.py::TestCommonCUDA::test_errors_index_select_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_errors_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_errors_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_errors_ne_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_avg_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_multilabel_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_reshape_as_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout4_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout1_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_errors_t_cuda, test/test_ops.py::TestCommonCUDA::test_errors_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_tensordot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___getitem___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_argwhere_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_char_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_put_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nanmedian_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_constant_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_reflect_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_with_distance_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ones_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_pow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_scaled_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_sparse_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___ror___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_amax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_chalf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_diag_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_empty_permuted_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_fft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ihfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_gather_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_masked_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_meshgrid_variadic_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nan_to_num_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_polygamma_polygamma_n_1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_repeat_interleave_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_slice_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_bessel_y1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_legendre_polynomial_p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_torch_ops_aten__safe_softmax_default_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmul___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__unsafe_masked_index_put_accumulate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_alias_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_baddbmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bincount_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bucketize_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_corrcoef_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dist_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_full_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_heaviside_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_histc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_householder_product_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_grad_oriented_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_vecdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mT_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matrix_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_reduction_with_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_full_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_margin_ranking_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_circular_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_negative_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_rms_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nonzero_static_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_fro_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_4_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_airy_ai_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_k0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_list_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_to_size_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_sparse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_2inputs_2outputs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorinv_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_rms_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_gaussian_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_hamming_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_kaiser_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_nuttall_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_out___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out___rxor___cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_bool_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_block_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_index_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_istft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cauchy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_celu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_det_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_householder_product_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_nuc_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_split_with_sizes_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unique_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___radd___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rmatmul___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning___rxor___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__chunk_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_T_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_bool_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_byte_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_float_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_atan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clamp_max_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_column_stack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_flipud_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_lgamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_mish_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_randn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_real_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_square_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_t_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_t_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__segment_reduce_lengths_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addmv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_argmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bernoulli_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ceil_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_char_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cummax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_expand_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_fft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifft2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_geometric_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lerp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eig_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_lu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_solve_triangular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_std_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mvlgamma_mvlgamma_p_5_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nanmean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv_transpose1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_dropout2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_local_response_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_normalize_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_reflect_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_pinverse_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_4_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_put_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_qr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_randint_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_gaussian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_laguerre_polynomial_l_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tile_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_var_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_u_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_like_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_geometric_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_minimum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_number_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs__conversions_complex_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_float_power_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fmax_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_le_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_roll_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_t_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_trace_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_view_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_complex_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_polar_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_add_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cauchy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expm1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fill_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hypot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_le_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_native_layer_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_celu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_group_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mse_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pdist_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_renorm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_roll_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcmul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_alias_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_arange_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardtanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_l1_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mse_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_unshuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_partial_views_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_floor_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_heaviside_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_list_of_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_minimum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_glu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_tanhshrink_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_remainder_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_split_with_sizes_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_xlogy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___getitem___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmatmul___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__unsafe_masked_index_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_all_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_aminmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_broadcast_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cdouble_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_combinations_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_geqrf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_unary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lerp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lstsq_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lu_unpack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mH_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv_transpose3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pca_lowrank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_t_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_to_sparse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_where_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_zeros_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_abs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_addr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_bernoulli_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cdouble_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_float_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_gather_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mH_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_batch_norm_without_cudnn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_inf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_transpose_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___getitem___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_empty_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_floor_divide_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_ge_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_vecdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log10_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_maximum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_new_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_repeat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signbit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_square_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_tan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zeros_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_aminmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_any_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_argsort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_asin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ceil_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_einsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_equal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_eye_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_index_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_reshape_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_take_along_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tensordot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_uniform_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_T_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__softmax_backward_data_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_shapes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_clamp_min_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cummax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_geqrf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_pool2d_with_indices_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_native_dropout_backward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_remainder_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_topk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_asinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_chunk_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_column_stack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_constant_pad_nd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_count_nonzero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_item_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_slogdet_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_silu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nonzero_static_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_positive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_randint_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_repeat_interleave_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_transpose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_xlogy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_zero__cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view_T_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rdiv___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view___rsub___cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_bool_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_char_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_double_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_all_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_broadcast_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_diagonal_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_ifftn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fliplr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_float_power_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_not_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_logical_xor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_movedim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_rot90_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_special_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_square_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_t_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_triu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_unfold_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_vdot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_where_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__unsafe_masked_index_put_accumulate_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_abs_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atleast_1d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_baddbmm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_byte_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cfloat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_chalf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_char_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_contiguous_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cumulative_trapezoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_equal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_expand_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_float_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_float_power_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_gradient_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_rank_hermitian_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_vander_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_xor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lu_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_mT_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_matmul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nanmean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_new_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_real_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reshape_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resolve_neg_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_add_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_short_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sigmoid_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_svd_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_take_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_triangular_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unflatten_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_as_real_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___rmul___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_int_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_clone_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_flatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_hstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_index_select_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_and_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_or_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_movedim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_mul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_narrow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_new_ones_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sgn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_squeeze_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_triu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_true_divide_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_byte_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_equal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_geqrf_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_hstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_index_select_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isnan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_4inputs_with_extra_args_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_det_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_inv_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_matrix_rank_hermitian_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logdet_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mT_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_cumsum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_fill_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_logsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_normalize_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_select_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_channel_shuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_triplet_margin_with_distance_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_repeat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_reshape_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_resolve_neg_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_roll_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_unflatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_uniform_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view__chunk_cat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_double_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_long_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_addcdiv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_addcmul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_arange_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clamp_min_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_floor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fmod_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_fill_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_item_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_le_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_svd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_minimum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_permute_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_renorm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_round_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_logit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_vsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_acos_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atan2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bernoulli_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cat_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diag_embed_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_dist_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_empty_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_irfft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_frexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_isnan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_item_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_2inputs_2outputs_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_kthvalue_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lgamma_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_inv_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_power_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_and_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logical_or_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_reduction_no_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_narrow_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_channel_shuffle_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardtanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_interpolate_bicubic_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_local_response_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_softmin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nonzero_static_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_fro_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_permute_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polar_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rand_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_randint_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ravel_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rsub_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_bartlett_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_hann_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_signal_windows_nuttall_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sinc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sort_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_log_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_k1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_scaled_modified_bessel_k1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_xlog1py_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_square_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sum_to_size_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_uniform_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsafe_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_view_copy_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_alias_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_allclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___ror___cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_allclose_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_not_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_or_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isnan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isreal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_argmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_new_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ones_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randn_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_short_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_squeeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_squeeze_multiple_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_topk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unflatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_baddbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_max_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cholesky_inverse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_i0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_minimum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_resolve_conj_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsqueeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___rdiv___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_deg2rad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_half_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_det_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_log10_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mT_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_entr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_square_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_trunc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_view_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_empty_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_fftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_full_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isposinf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isreal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_inv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rand_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_randn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unbind_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsqueeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addbmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bool_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_floor_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_empty_permuted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_gradient_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_isreal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lcm_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cholesky_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_not_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nan_to_num_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_empty_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softplus_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nonzero_static_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ones_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_remainder_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scalar_tensor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_y0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_log_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_triu_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unique_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_vsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_where_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_ones_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_bool, test/test_ops.py::TestTagsCUDA::test_tags_T_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___radd___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___rmod___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags___ror___cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags___rpow___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_T_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_byte_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs__conversions_int_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_atleast_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_broadcast_to_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ceil_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_diagonal_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_flipud_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fmod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_i0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isclose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logical_or_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_logspace_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nan_to_num_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_narrow_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_full_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_celu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_normal__in_place_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_renorm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_rot90_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_tanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_true_divide_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unflatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unfold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unsqueeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_var_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_abs_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_addmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atan2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_left_shift_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_bitwise_xor_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_deg2rad_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_erfinv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_expand_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifftshift_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_irfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flip_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_index_reduce_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cross_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_factor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_power_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_qr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_svd_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_vander_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logdet_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_and_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_argmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_cumsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_area_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_normalize_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_silu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softmin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nonzero_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_inf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ormqr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resize_as__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_roll_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signbit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_i1e_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_xlog1py_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_split_with_sizes_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_triu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_trunc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsafe_chunk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unsqueeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_where_cuda_float32 2024-11-01T17:21:36.8869470Z 2024-11-01T17:21:36.8869729Z Running test_ops 9/9 ... [2024-11-01 17:21:36.646397] 2024-11-01T17:21:36.8870127Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:21:36.8871518Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_ops.py', '-m', 'not serial', '--shard-id=9', '--num-shards=9', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:21:36.646978] 2024-11-01T17:28:14.0675538Z 2024-11-01T17:28:14.0682365Z test_ops 6/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_6.9_106da0b147e130fd_.log 2024-11-01T17:28:14.2043754Z Running 3697 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_argsort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_kthvalue_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_multinomial_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_ctc_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_randn_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_atleast_2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_empty_permuted_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_hfftn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_ifft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_isfinite_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_item_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_mH_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose3d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nonzero_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_ones_like_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_permute_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_scalar_tensor_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_sum_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes___rand___cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_char_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_addr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_broadcast_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_copysign_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_empty_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_floor_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isfinite_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_lt_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_ndtri_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_square_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_true_divide_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_as_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__softmax_backward_data_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__unsafe_masked_index_put_accumulate_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_add_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addmv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_all_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cauchy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_char_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_dist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_eq_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ge_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_index_reduce_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isneginf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_isposinf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_4inputs_with_extra_args_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_binary_return_by_ref_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_ldl_factor_ex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_matrix_rank_hermitian_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_multi_dot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_slogdet_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_vecdot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_median_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_min_binary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_multinomial_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_narrow_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_binary_cross_entropy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_elu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_feature_alpha_dropout_without_train_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_bicubic_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_unpool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_one_hot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pad_replicate_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_relu6_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_triplet_margin_with_distance_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_upsample_nearest_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_nuc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_pca_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polygamma_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_positive_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_rand_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_reciprocal_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_scatter_reduce_amin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_searchsorted_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sgn_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_nuttall_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_bessel_y1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_erfcx_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_hermite_polynomial_h_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_square_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_t_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_t_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tensordot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_to_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_trapz_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_var_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_errors_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_complex_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfft2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_errors_jiterator_binary_cuda, test/test_ops.py::TestCommonCUDA::test_errors_le_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linalg_lstsq_grad_oriented_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_rsub_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_blackman_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_gaussian_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_errors_signal_windows_general_hamming_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_mul_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_zeros_like_layout2_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_xlog1py_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_clamp_max_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_det_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_decimals_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_vstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices__unsafe_masked_index_put_accumulate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cartesian_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cholesky_inverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_max_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_clone_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cummin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_dstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_hfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fliplr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_logsumexp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_median_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_circular_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_positive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_randint_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize_as__cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_y1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_modified_bessel_i0_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_list_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_H_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___radd___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values___rdiv___cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values__unsafe_masked_index_put_accumulate_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_addr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_angle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_as_strided_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_asin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_2d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bitwise_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cos_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_erfc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_expand_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_max_reduction_with_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_nn_functional_pixel_unshuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_permute_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_resize__cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_scalar_tensor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sin_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_hermite_polynomial_he_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unique_consecutive_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___getitem___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rdiv___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rmod___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rsub___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_abs_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argwhere_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_as_strided_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_byte_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_contiguous_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagflat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_permuted_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_ihfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flipud_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_flipud_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isfinite_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_item_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_unary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_ldl_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logcumsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_xor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_matrix_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_max_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_list_of_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_binary_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_min_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmedian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softsign_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pca_lowrank_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scalar_tensor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_short_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_signal_windows_cosine_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_slice_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_airy_ai_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_u_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_chebyshev_polynomial_v_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_hermite_polynomial_he_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_t_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_argwhere_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_cat_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_clamp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diagflat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diff_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_tensorsolve_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vander_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_conv_transpose2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_roll_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_general_cosine_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_fft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_isinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_expand_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_i0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_igamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_int_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isreal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lcm_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_lgamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_svdvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_renorm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_angle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_angle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_diff_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_exp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ifft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_rfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_index_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cond_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_det_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_gelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rad2deg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_special_log_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_true_divide_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_xlogy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_resolve_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_gaussian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_take_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_uniform_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning___rpow___cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_int_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_acos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addcdiv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addcmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_addr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_bitwise_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_clone_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_contiguous_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_cumsum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_empty_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_erf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_frac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isinf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_isreal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_and_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_meshgrid_variadic_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_neg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_margin_ranking_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pixel_shuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_tanhshrink_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_triplet_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_permute_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_positive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_reshape_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sign_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_bessel_j1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_split_with_sizes_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_squeeze_multiple_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sub_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tan_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_transpose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_trunc_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unsqueeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_acos_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_asinh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_block_diag_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cholesky_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_exp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_eye_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_fft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_hfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_i0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isclose_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_2inputs_2outputs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvals_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_eigvalsh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_logit_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_long_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_lu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_matmul_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_msort_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mv_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nan_to_num_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_ne_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_adaptive_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_avg_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_conv2d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_cosine_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_logsigmoid_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_pool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pad_replicate_negative_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_smooth_l1_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_unfold_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nonzero_static_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_real_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_resolve_conj_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_decimals_neg_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_short_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_blackman_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_sin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_split_list_args_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_std_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_to_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_trapz_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsafe_split_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_vdot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_cos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_exp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_3_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_4_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_t_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_chebyshev_polynomial_w_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_xlog1py_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bool_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acos_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_2d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_max_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clone_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumsum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_floor_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flipud_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmax_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_vector_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log10_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logaddexp_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logsumexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nextafter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_gelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardtanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_leaky_relu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_nll_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_prelu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal__in_place_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_prod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rad2deg_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_round_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_signbit_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_erfcx_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i0e_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_take_along_dim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_to_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trunc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_zeros_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diag_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_dot_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_hfftn_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_fft_rfft2_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_igamma_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_prelu_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_softshrink_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_special_zeta_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_tril_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_vsplit_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_T_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bfloat16_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_byte_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_chalf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_half_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_int_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_long_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_short_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_allclose_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amax_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asinh_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_1d_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_3d_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_left_shift_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_to_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bucketize_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_chunk_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clone_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_constant_pad_nd_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_copysign_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumprod_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cumsum_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_deg2rad_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_floor_rounding_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dot_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_like_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eq_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_as_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exponential_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfftn_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfftn_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frac_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_frexp_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ge_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_i0_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_igamma_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isreal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_diagonal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_matrix_norm_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vector_norm_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log2_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_list_of_tensors_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_meshgrid_variadic_tensors_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_native_layer_norm_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_ones_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_dropout_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_elu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_gelu_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_glu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_huber_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_mish_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_unshuffle_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_poisson_nll_loss_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_selu_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmax_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softplus_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_pow_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reciprocal_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_repeat_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rot90_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sgn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_entr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_1_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_std_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vsplit_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bool_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_byte_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cfloat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_short_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addcdiv_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_allclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_asinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_or_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_right_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_min_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clone_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_contiguous_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumsum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_no_rounding_mode_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eye_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_float_power_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_geometric_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_heaviside_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hypot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_le_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_vector_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logaddexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_masked_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_list_of_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ne_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_strided_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_celu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_channel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_glu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_tanhshrink_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_threshold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_number_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_signbit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_entr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i0e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1e_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_zeta_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_split_with_sizes_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_T_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_char_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_any_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_2d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_left_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_right_shift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_min_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_copysign_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_digamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftshift_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_rfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flip_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flipud_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmax_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmod_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gcd_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isclose_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isnan_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isneginf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_le_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log1p_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_normal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_meshgrid_variadic_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_mul_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nextafter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_celu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_group_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hinge_embedding_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_nll_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_unshuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_triplet_margin_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_number_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ones_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_roll_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sigmoid_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_logit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_square_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_to_size_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tan_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_complex_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_H_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rdiv___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__chunk_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager__softmax_backward_data_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_decomposed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_argwhere_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cartesian_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_clone_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_constant_pad_nd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cummax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dist_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_empty_permuted_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_eq_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expand_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftshift_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flip_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gradient_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_heaviside_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_imag_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eig_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_svd_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log10_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logcumsumexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logcumsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_xor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_long_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mT_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_var_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nansum_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_native_dropout_backward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_empty_strided_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_feature_alpha_dropout_without_train_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pca_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rand_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reciprocal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_resolve_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_short_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_j1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_i1e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_std_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_t_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_tan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_min_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_embedding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_glu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_norm_fro_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_permute_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_reshape_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sinc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sparse_sampled_addmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_take_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trapezoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rsub___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_conj_physical_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fmod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_isfinite_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_rrelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_softsign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randint_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resolve_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_squeeze_multiple_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_transpose_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_vstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__unsafe_masked_index_put_accumulate_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addcdiv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_empty_permuted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isinf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eig_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_unpack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_var_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_new_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mish_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ormqr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rad2deg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scalar_tensor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_sparse_mm_reduce_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_triangular_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unflatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_aminmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_as_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_hfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_heaviside_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isneginf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nan_to_num_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_empty_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_feature_alpha_dropout_without_train_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_triplet_margin_with_distance_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_ones_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_qr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_searchsorted_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_airy_ai_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_ndtri_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_squeeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_svd_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atan2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_bool_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cummin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_deg2rad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_hfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ihfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_flipud_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_igamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_kron_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ldexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_cholesky_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_det_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_solve_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_svd_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_log1p_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logcumsumexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_lu_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_cumprod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_narrow_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_empty_strided_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_new_zeros_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_rms_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_pinverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_put_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_exponential_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_slice_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_hermite_polynomial_he_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_modified_bessel_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_xlog1py_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_vdot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_view_as_complex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_zeros_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view_H_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_cdouble_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_short_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcdiv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_addcmul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_eq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_expand_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_expand_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_flipud_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_lerp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linalg_matrix_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_masked_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_empty_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_empty_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_ones_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_squeeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_std_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_t_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_trace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_transpose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_acos_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addcdiv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addcmul_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_alias_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_argwhere_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_asinh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_atleast_2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diff_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_double_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_expand_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_irfft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_flipud_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_select_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_int_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isinf_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_isnan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_jiterator_binary_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cross_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_factor_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lu_solve_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_norm_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_solve_triangular_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_lu_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_logsumexp_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_conv2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_linear_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pad_reflect_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_nuc_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_outer_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_roll_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_slice_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_split_list_args_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_squeeze_multiple_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_stack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_t_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_take_along_dim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tanh_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_to_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsqueeze_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_zero__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_T_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_char_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_short_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_allclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_cos_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_diagonal_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_dsplit_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_eye_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fliplr_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isfinite_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_isnan_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_nn_functional_channel_shuffle_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_positive_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reciprocal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_sum_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_transpose_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unflatten_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_unsqueeze_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addcmul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_argwhere_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_asin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_baddbmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cdouble_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cumulative_trapezoid_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_dstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_empty_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_fftshift_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_ifft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_fft_irfftn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_isclose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_item_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_jiterator_binary_return_by_ref_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_diagonal_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_eigh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_inv_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_svdvals_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv3d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_conv_transpose1d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_replicate_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_softsign_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_pinverse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ravel_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_split_with_sizes_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_triangular_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_vdot_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_view_as_real_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_zero__cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___getitem___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_half_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs__conversions_int_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_any_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_asin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_asinh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_bucketize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_column_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_eq_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_rfft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_heaviside_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_hsplit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_linalg_vecdot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log_normal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logaddexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logspace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logsumexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_movedim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_narrow_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_mish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_relu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softplus_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal__in_place_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_normal_number_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_reshape_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_i1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_zeta_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_square_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_sum_to_size_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_t_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tanh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tensor_split_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_transpose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_true_divide_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unsqueeze_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__unsafe_masked_index_put_accumulate_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_addcmul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_argmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_argsort_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_as_strided_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_bucketize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_copysign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cumulative_trapezoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_erfinv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_exp2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_exp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expm1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_fft2_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ihfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_rfftn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flipud_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_float_power_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_floor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_geqrf_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_grid_sampler_2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_histc_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_binary_return_by_ref_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_le_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigh_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_inv_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_ldl_factor_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_multi_dot_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_pinv_hermitian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_ex_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log1p_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_lu_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mT_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_cumsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_log_softmax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_normalize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_matmul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_native_dropout_backward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_adaptive_avg_pool3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_batch_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_embedding_bag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_glu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_linear_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_pool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool1d_grad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool2d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_max_unpool2d_grad_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multi_head_attention_forward_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_normalize_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_reflect_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_pad_replicate_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pinverse_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_pow_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_put_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_randn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resize_as__cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resolve_conj_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rot90_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scatter_reduce_mean_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_searchsorted_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_bessel_j1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_chebyshev_polynomial_u_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_entr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_i1e_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_stack_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_svd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_true_divide_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unfold_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_xlogy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_zeros_like_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake___rmod___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__chunk_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_all_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atan2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rmatmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_partial_views_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bincount_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_bitwise_left_shift_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_byte_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_clamp_min_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_conj_physical_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_count_nonzero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cummin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_empty_permuted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_equal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_irfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_full_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_gt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_heaviside_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_inner_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_int_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lerp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_svdvals_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_not_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_or_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_logsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_movedim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nonzero_static_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_ormqr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_permute_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_positive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_randint_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resolve_neg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rsub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_y1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_ndtri_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_tril_indices_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_unique_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_vdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bincount_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp___rpow___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_atleast_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fliplr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_narrow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsafe_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_unsqueeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp___radd___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_eigh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_inv_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_msort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nanquantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_celu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_gaussian_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_linear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_to_sparse_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unfold_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_rfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flatten_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_item_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ldexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_householder_product_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vector_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nanmean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_native_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nextafter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_fractional_max_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_permute_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_renorm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_roll_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_rot90_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_kaiser_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_nuttall_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_t_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_torch_ops_aten__flash_attention_forward_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_fake_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_trapz_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_triangular_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unique_consecutive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_view_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_asin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bernoulli_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bitwise_and_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cdist_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_clone_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_cumsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_div_trunc_rounding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ihfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_full_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_grid_sampler_2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_kron_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_le_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_det_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logcumsumexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lu_unpack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_argmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_fill_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_min_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_native_dropout_backward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_new_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_elu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_relu6_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_silu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_normal_number_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_outer_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_permute_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_rad2deg_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_searchsorted_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_bessel_j1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sum_to_size_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_trace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_transpose_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_var_mean_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_zero__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_int32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_zeros_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_int8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_int32, test/test_ops.py::TestTagsCUDA::test_tags__batch_norm_with_update_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_bitwise_or_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_clamp_max_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_contiguous_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_expand_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hsplit_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_hypot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_igamma_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isneginf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_lcm_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags__refs_log1p_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_masked_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_pixel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sgn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sinc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_transpose_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_xlogy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__softmax_backward_data_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argwhere_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_atanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_bernoulli_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cauchy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ceil_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cfloat_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_clamp_min_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_conj_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cosh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_diff_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_div_floor_rounding_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_ifftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_rfft_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_flatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_float_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_gather_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_grid_sampler_2d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_i0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_imag_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_index_add_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_householder_product_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_inv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lstsq_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_multi_dot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_vector_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_or_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lu_unpack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_cumprod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_log_softmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_logsumexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_select_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_std_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_matrix_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_min_reduction_no_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_movedim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_multinomial_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nan_to_num_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nanmean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_native_batch_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_channel_shuffle_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose1d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_linear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_mish_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_one_hot_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_softplus_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_in_place_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_normal_number_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_positive_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_pow_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_quantile_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_randint_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_decimals_3_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_blackman_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_signal_windows_hann_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_bessel_y1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_legendre_polynomial_p_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_log_ndtr_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_ndtri_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_take_along_dim_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_unique_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_vstack_cuda_float32 2024-11-01T17:28:14.3184662Z 2024-11-01T17:28:14.3185014Z Running test_binary_ufuncs 1/1 ... [2024-11-01 17:28:14.075908] 2024-11-01T17:28:14.3185467Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:28:14.3186768Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_binary_ufuncs.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:28:14.076406] 2024-11-01T17:29:04.9709270Z 2024-11-01T17:29:04.9715293Z test_ops 9/9 was successful, full logs can be found in artifacts with path test/test-reports/test_ops_9.9_d3b5accf1732e53f_.log 2024-11-01T17:29:05.1208259Z Running 3757 items in this shard: test/test_ops.py::TestCommonCUDA::test_compare_cpu___rdiv___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu___rsub___cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_arange_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_as_strided_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_fft_fftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_nn_functional_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_repeat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_split_with_sizes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cdist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_column_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cummin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_exponential_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_gradient_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_igammac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_index_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_multi_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_slogdet_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_native_batch_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_inf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_resize_as__cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_special_zeta_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_std_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_to_sparse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_trace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_triangular_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_compare_cpu_view_as_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_as_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_cdouble_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_char_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_mT_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_empty_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_new_empty_strided_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_nn_functional_conv_transpose2d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_rsqrt_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_squeeze_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_trace_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_complex_half_reference_testing_view_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_byte_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs__conversions_double_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_bitwise_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_clamp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_constant_pad_nd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_diagonal_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_eq_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erfc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_erfinv_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_fmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_hypot_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_imag_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_index_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_le_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_linalg_svdvals_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log10_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_log2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_logspace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nan_to_num_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_neg_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_new_zeros_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hardshrink_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_hardtanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_normal__in_place_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_pow_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sinh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_special_multigammaln_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_sub_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tanh_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_trace_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes__refs_triu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_addmm_decomposed_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_atan_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_byte_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cartesian_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_count_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_cumulative_trapezoid_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diag_embed_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_diagflat_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_double_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_einsum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_empty_permuted_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_hfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fft_irfft2_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_fill_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_full_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_gradient_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_histc_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_hsplit_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_jiterator_unary_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_le_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_det_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_pinv_singular_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_linalg_vector_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_log_softmax_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logical_xor_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_logsumexp_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mT_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_argmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_softmin_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_masked_sum_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_matmul_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_min_reduction_with_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_mm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_native_dropout_backward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ne_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_avg_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv1d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_conv3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_fractional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_huber_loss_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_instance_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_interpolate_area_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_leaky_relu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_multi_head_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_rrelu_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nn_functional_threshold_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_nonzero_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_norm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_normal_in_place_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ones_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_ones_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_outer_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_pinverse_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_polar_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_prod_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randint_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_randn_like_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_remainder_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_select_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_signal_windows_kaiser_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_sparse_sampled_addmm_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_airy_ai_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_chebyshev_polynomial_w_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_i1e_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_ndtr_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_special_scaled_modified_bessel_k0_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_std_mean_unbiased_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_svd_lowrank_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_take_along_dim_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_tensor_split_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_torch_ops_aten__safe_softmax_default_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_unsafe_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_as_complex_cuda, test/test_ops.py::TestCommonCUDA::test_dtypes_view_copy_cuda, test/test_ops.py::TestCommonCUDA::test_errors___rmul___cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_or_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bitwise_right_shift_cuda, test/test_ops.py::TestCommonCUDA::test_errors_bucketize_cuda, test/test_ops.py::TestCommonCUDA::test_errors_diagonal_cuda, test/test_ops.py::TestCommonCUDA::test_errors_div_floor_rounding_cuda, test/test_ops.py::TestCommonCUDA::test_errors_exponential_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_fftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_ihfftn_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fft_irfft_cuda, test/test_ops.py::TestCommonCUDA::test_errors_fliplr_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gcd_cuda, test/test_ops.py::TestCommonCUDA::test_errors_gradient_cuda, test/test_ops.py::TestCommonCUDA::test_errors_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_errors_jiterator_binary_return_by_ref_cuda, test/test_ops.py::TestCommonCUDA::test_errors_linspace_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logical_or_cuda, test/test_ops.py::TestCommonCUDA::test_errors_logspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_errors_maximum_cuda, test/test_ops.py::TestCommonCUDA::test_errors_minimum_cuda, test/test_ops.py::TestCommonCUDA::test_errors_mul_cuda, test/test_ops.py::TestCommonCUDA::test_errors_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_neg_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_adaptive_max_pool3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_conv1d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_conv2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_conv3d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_max_pool2d_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_rms_norm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_nn_functional_soft_margin_loss_cuda, test/test_ops.py::TestCommonCUDA::test_errors_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_errors_scatter_add_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_randn_like_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_sparse_sum_layout3_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_hermite_polynomial_he_cuda, test/test_ops.py::TestCommonCUDA::test_errors_special_legendre_polynomial_p_cuda, test/test_ops.py::TestCommonCUDA::test_errors_triu_cuda, test/test_ops.py::TestCommonCUDA::test_errors_xlogy_cuda, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_argmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_asin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_eye_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_frexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_kron_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_le_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_linalg_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_masked_select_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_mv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_ormqr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_quantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_decimals_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_squeeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_vdot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_meta_consistency_out_dtype_mismatch_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___getitem___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___radd___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices___rmatmul___cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices___ror___cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices__segment_reduce_offsets_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_addmv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_alias_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_aminmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_as_strided_partial_views_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_atan2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bfloat16_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_block_diag_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_embed_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diag_embed_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_diff_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_einsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_exp_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ifftshift_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_ihfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_irfft_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_index_reduce_mean_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_int_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_jiterator_unary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_eigh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_lstsq_grad_oriented_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_tensorinv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linalg_vander_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_lu_unpack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mT_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_amin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_argmax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_cumprod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_log_softmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_masked_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nan_to_num_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_narrow_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_cosine_embedding_loss_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_kl_div_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_mish_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_pad_replicate_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_relu_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_nonzero_static_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_outer_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reciprocal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_reshape_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resize__cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_resolve_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_rot90_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sinc_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_slice_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_special_chebyshev_polynomial_v_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_sum_to_size_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_t_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_tile_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_torch_ops_aten__safe_softmax_default_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unbind_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_unsqueeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_multiple_devices_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_multiple_devices_xlogy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values__chunk_cat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_argwhere_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_atleast_3d_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_copysign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_cummax_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_erf_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_ihfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fft_rfft_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_fill_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flip_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_flipud_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_float_power_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_logical_and_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_new_ones_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_permute_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_short_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_sign_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_i0e_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_modified_bessel_k0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_special_zeta_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_split_with_sizes_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_square_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_t_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_tan_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unique_cuda_bool, test/test_ops.py::TestCommonCUDA::test_non_standard_bool_values_unsafe_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_H_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rand___cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples___rpow___cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__chunk_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples__upsample_bilinear2d_aa_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_addmv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_arange_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_argsort_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_asin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atan_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_baddbmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cartesian_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_chunk_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_combinations_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_conj_physical_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_count_nonzero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cummax_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumprod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_dist_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_div_trunc_rounding_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_empty_permuted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_equal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_exp2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_fftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_hsplit_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_index_select_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_inner_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_isneginf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_2inputs_2outputs_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_4inputs_with_extra_args_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_jiterator_binary_return_by_ref_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ldexp_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cholesky_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_cross_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_det_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_inv_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_slogdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linalg_svdvals_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logdet_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_and_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_lu_solve_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mH_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_cumsum_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_prod_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_std_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_masked_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_meshgrid_variadic_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_msort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_mul_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nanquantile_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_narrow_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ne_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_new_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_adaptive_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_avg_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_bilinear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_binary_cross_entropy_with_logits_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_l1_loss_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_pool2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_normalize_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_pixel_unshuffle_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_silu_complex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_softsign_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_threshold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ones_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_polygamma_polygamma_n_1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_qr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randint_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_randn_like_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_ravel_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_reciprocal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_repeat_interleave_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resize_as__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_resolve_neg_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_roll_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_round_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_rsub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_scatter_reduce_prod_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_entr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_ndtr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_split_with_sizes_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_std_unbiased_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_take_along_dim_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_trapz_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unflatten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_uniform_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsafe_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_view_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_vstack_cuda_int64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zero__cuda_complex64, test/test_ops.py::TestCommonCUDA::test_noncontiguous_samples_zeros_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_numpy_ref_broadcast_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_cat_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_jiterator_4inputs_with_extra_args_cuda_int64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vander_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_linalg_vecdot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_numpy_ref_meshgrid_variadic_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_bartlett_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_signal_windows_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_squeeze_multiple_cuda_float64, test/test_ops.py::TestCommonCUDA::test_numpy_ref_tensor_split_cuda_float64, test/test_ops.py::TestCommonCUDA::test_out___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_cdouble_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_addcdiv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_amin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_contiguous_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_expm1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_flip_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_float_power_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_lerp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_logaddexp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_ne_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_elu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_glu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_special_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unbind_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out__refs_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_as_strided_partial_views_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_out_cholesky_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_chunk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_clamp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_corrcoef_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_dstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_fft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_fliplr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_float_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_floor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_hsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_isclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_ldexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_eigvals_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_inv_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_ldl_factor_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_lu_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_masked_softmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nextafter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_interpolate_linear_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_pairwise_distance_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_nn_functional_threshold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_norm_nuc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_pow_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_rand_like_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error__batch_norm_with_update_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_acos_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addcmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addmv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_addr_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_column_stack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_cross_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_hfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_ihfft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fft_irfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_fmod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_gather_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_eigvalsh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_lu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_linalg_solve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log1p_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_log_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_lu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_matmul_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mode_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_mv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nansum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_norm_inf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_outer_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_scatter_add_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sort_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_squeeze_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_sub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_topk_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_requires_grad_error_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_searchsorted_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_i0e_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_special_ndtri_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_stft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_svd_lowrank_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_trapezoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_tril_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unfold_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_unsafe_split_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_out_warning_H_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__batch_norm_with_update_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs__conversions_short_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_all_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_any_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_as_strided_scatter_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_conj_physical_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_digamma_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_dot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_eye_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_fftshift_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_fft_irfftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_istft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_linalg_cross_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_log10_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_masked_fill_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_native_layer_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_new_full_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_group_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_pixel_unshuffle_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_nn_functional_softmin_with_dtype_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_ones_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_renorm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_repeat_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_rsqrt_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sgn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_sin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_multigammaln_mvlgamma_p_1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_multigammaln_mvlgamma_p_3_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_special_spherical_bessel_j0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_std_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_tril_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_triu_indices_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_unflatten_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__refs_where_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning__unsafe_masked_index_put_accumulate_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_abs_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_add_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_addr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_alias_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_amin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_asin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_atan2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_bmm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_broadcast_tensors_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_conj_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cosh_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_cumprod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_double_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_eq_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ifftn_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_ihfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_fft_rfft_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_frexp_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_full_like_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_half_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_igammac_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_index_put_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_isin_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_jiterator_unary_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_cholesky_ex_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_det_singular_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_ldl_solve_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_matrix_power_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_norm_subgradients_at_zero_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_pinv_hermitian_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_qr_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linalg_svd_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_linspace_tensor_overload_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log2_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_normal_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_log_softmax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_median_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_prod_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_masked_var_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_max_pool2d_with_indices_backward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nansum_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_narrow_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_batch_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_celu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_glu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_instance_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_bicubic_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_interpolate_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool1d_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_max_unpool2d_grad_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_one_hot_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pairwise_distance_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_pdist_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_poisson_nll_loss_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_nn_functional_upsample_bilinear_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_norm_inf_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_outer_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_polygamma_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_positive_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_rad2deg_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_repeat_interleave_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_round_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_amax_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_scatter_reduce_mean_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_general_cosine_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_signal_windows_kaiser_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_bessel_y0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_polygamma_special_polygamma_n_0_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_special_scaled_modified_bessel_k1_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_squeeze_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_t_copy_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_torch_ops_aten__flash_attention_forward_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_triu_cuda, test/test_ops.py::TestCommonCUDA::test_out_warning_unsafe_chunk_cuda, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_asinh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_atan_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_copysign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_div_no_rounding_mode_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_erfc_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_expm1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_lgamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log1p_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_mean_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_std_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_masked_var_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_polygamma_polygamma_n_2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rad2deg_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_rsqrt_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int32, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sinh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_hermite_polynomial_he_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_laguerre_polynomial_l_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_special_zeta_cuda_int64, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_promotes_int_to_float_xlogy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_T_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_bfloat16_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_byte_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_cfloat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_char_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_double_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_float_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_half_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_int_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_long_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs__conversions_short_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_acosh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addcmul_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_addr_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_alias_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_all_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_amin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_arange_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_partial_views_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_as_strided_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_asinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atan2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_1d_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_atleast_3d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_and_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_bitwise_right_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_block_diag_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cauchy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_chunk_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_clamp_min_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_conj_physical_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_constant_pad_nd_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_contiguous_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_count_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_cumprod_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_deg2rad_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diag_embed_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_dsplit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_empty_strided_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_equal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_erfinv_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_exp2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_as_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expand_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_eye_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_fftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_hfft2_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ifftshift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfft_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_ihfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_irfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fft_rfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_flatten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fliplr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_float_power_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_floor_divide_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_fmod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gcd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ge_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_gt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_heaviside_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_hstack_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_imag_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_add_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_index_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isclose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isinf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isnan_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isposinf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_isreal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_item_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_le_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lgamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_cross_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linalg_svd_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_linspace_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log1p_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_and_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_not_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logical_xor_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_logspace_tensor_overload_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_lt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_masked_fill_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_maximum_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_mul_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_narrow_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ne_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_full_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_new_zeros_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_channel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_elu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_group_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_hardshrink_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_log_softmax_with_dtype_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu6_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_relu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_smooth_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softmin_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_softshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_tanhshrink_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_threshold_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_nn_functional_triplet_margin_loss_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_permute_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_positive_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_ravel_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_real_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reciprocal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_remainder_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_renorm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_reshape_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_roll_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rot90_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_rsub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_select_scatter_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sgn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sigmoid_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sign_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sinh_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_bessel_j1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_entr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_i1e_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtr_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_ndtri_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_spherical_bessel_j0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_split_with_sizes_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sqrt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_square_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_sum_to_size_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_t_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tan_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tanh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tensor_split_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_trace_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_transpose_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_indices_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_tril_indices_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_triu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_true_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unbind_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unfold_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_unsqueeze_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_var_mean_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vdot_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_as_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_view_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_where_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref__refs_xlogy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_dstack_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_gt_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_heaviside_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_item_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_logaddexp_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nextafter_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_nn_functional_hinge_embedding_loss_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_errors__refs_pow_cuda, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_bool_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cdouble_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_cfloat_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_char_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_double_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs__conversions_float_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_abs_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acos_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_acosh_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcdiv_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addcmul_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_addr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_alias_copy_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_amin_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_any_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_arange_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_partial_views_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_as_strided_scatter_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_asin_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atanh_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_atleast_2d_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_and_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_not_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_or_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_right_shift_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_bitwise_xor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_block_diag_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_broadcast_tensors_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cat_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ceil_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_max_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_clamp_min_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_column_stack_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_conj_physical_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_contiguous_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cos_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_cosh_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_count_nonzero_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_embed_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diag_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_diagonal_scatter_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_digamma_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_no_rounding_mode_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_div_trunc_rounding_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dsplit_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_dstack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_empty_strided_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_equal_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_erfinv_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_exp2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_expand_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_eye_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fft2_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftn_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_fftshift_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_hfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifft_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftn_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ifftshift_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_ihfft2_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft2_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_irfft_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfft2_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fft_rfftn_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flatten_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flip_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fliplr_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_flipud_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_float_power_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_divide_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_floor_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmax_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmin_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_fmod_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_geometric_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_gt_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_heaviside_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hsplit_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_hstack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_add_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_copy_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_fill_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_index_select_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isclose_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isinf_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isnan_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isneginf_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_isposinf_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_item_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lerp_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_lgamma_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_cross_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_norm_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_svdvals_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linalg_vecdot_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_linspace_tensor_overload_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log10_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log1p_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_log_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logaddexp2_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_and_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_not_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_or_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logical_xor_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logspace_tensor_overload_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_logsumexp_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_masked_fill_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_maximum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_minimum_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_movedim_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_mul_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nan_to_num_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_narrow_copy_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ne_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_neg_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_empty_strided_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_full_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_new_zeros_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nextafter_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_channel_shuffle_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardshrink_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_hardtanh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_l1_loss_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_layer_norm_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_log_softmax_with_dtype_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_margin_ranking_loss_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pairwise_distance_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_pixel_shuffle_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu6_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_relu_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_smooth_l1_loss_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softmin_with_dtype_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_softshrink_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_tanhshrink_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_threshold_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_nn_functional_triplet_margin_loss_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_norm_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal__in_place_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_normal_number_mean_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ones_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_permute_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_positive_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_prod_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rad2deg_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_ravel_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_real_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_remainder_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_as_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_reshape_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_round_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsqrt_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_rsub_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_select_scatter_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sigmoid_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sign_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_signbit_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sin_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinc_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sinh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_softmax_with_dtype_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_bessel_j0_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_erfcx_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i0e_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_i1e_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_ndtr_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_log_softmax_with_dtype_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_logit_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_3_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_multigammaln_mvlgamma_p_5_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtr_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_ndtri_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_softmax_with_dtype_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_spherical_bessel_j0_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_xlog1py_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_special_zeta_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_split_with_sizes_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sqrt_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_square_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_squeeze_multiple_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_stack_executor_aten_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sub_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_sum_to_size_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_t_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_take_along_dim_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tan_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tanh_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tensor_split_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_to_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trace_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_copy_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_transpose_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_tril_indices_executor_aten_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_triu_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_true_divide_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_trunc_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unbind_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unflatten_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_copy_executor_aten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unfold_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_copy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_unsqueeze_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_var_mean_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vdot_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_complex_executor_aten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_as_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_view_copy_executor_aten_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_vstack_executor_aten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_where_executor_aten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_xlogy_executor_aten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_executor__refs_zeros_executor_aten_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_T_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_bfloat16_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_cdouble_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_chalf_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_double_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_half_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_int_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_long_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs__conversions_polar_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acos_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_acosh_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_add_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_addr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_all_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_amin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_any_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_as_strided_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atanh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_1d_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_2d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_atleast_3d_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_and_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bitwise_left_shift_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_block_diag_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_shapes_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_tensors_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_bucketize_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cauchy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ceil_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_chunk_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_clamp_max_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_column_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_conj_physical_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_copysign_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cos_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_count_nonzero_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_deg2rad_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diag_embed_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_digamma_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_floor_rounding_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_div_trunc_rounding_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_dstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_empty_like_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_eq_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfc_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exp2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expand_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_expm1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_exponential_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_fftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfft_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ifftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfft_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_ihfftn_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfft_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfft2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fft_rfftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fill_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flip_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fliplr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_flipud_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_floor_divide_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmax_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmin_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_fmod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_frac_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ge_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_gt_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hsplit_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_hstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_i0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_add_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_index_select_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isclose_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isfinite_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isneginf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isposinf_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_isreal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_istft_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lerp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_lgamma_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_cross_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_diagonal_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_matrix_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linalg_svd_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log10_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log2_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_log_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_or_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logical_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logspace_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_logsumexp_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_meshgrid_variadic_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_minimum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_mul_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nan_to_num_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_narrow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_native_layer_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_neg_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_empty_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_full_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_new_zeros_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_dropout_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_huber_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_layer_norm_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_margin_ranking_loss_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_mish_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pairwise_distance_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_pixel_shuffle_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu6_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_smooth_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmax_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softmin_with_dtype_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_softplus_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_nn_functional_triplet_margin_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_norm_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_normal_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ones_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_permute_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_positive_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_pow_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_randn_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_ravel_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_remainder_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_reshape_as_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_roll_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rot90_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_round_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsqrt_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_rsub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_select_scatter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sgn_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sigmoid_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sign_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sin_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sinh_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_erfcx_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_i1_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_ndtr_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_log_softmax_with_dtype_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_logit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_ndtri_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_spherical_bessel_j0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_special_xlog1py_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sqrt_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_square_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_squeeze_multiple_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_std_mean_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sub_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_sum_to_size_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_t_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_take_along_dim_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tan_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tensor_split_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_to_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trace_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_transpose_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_tril_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_true_divide_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_trunc_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unbind_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unflatten_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_copy_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unfold_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_unsqueeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_var_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_complex_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_as_complex_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_copy_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_view_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vsplit_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_vstack_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_where_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_meta__refs_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bfloat16_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_bool_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_byte_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cdouble_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_cfloat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_double_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_float_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_half_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_int_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_long_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs__conversions_short_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_abs_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_acosh_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_add_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcdiv_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addcmul_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_addr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_alias_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_all_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_allclose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amax_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_amin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_arange_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_copy_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_as_strided_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_asin_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atan2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_1d_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_atleast_3d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_and_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_not_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_or_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bitwise_xor_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_block_diag_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_tensors_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_broadcast_to_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_bucketize_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cat_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ceil_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_chunk_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clamp_max_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_clone_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_column_stack_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_constant_pad_nd_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_contiguous_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cos_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumprod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_cumsum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_deg2rad_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diag_embed_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_diagonal_scatter_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_no_rounding_mode_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_div_trunc_rounding_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dsplit_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_dstack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_like_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_empty_strided_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_eq_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_equal_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erf_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_erfinv_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_exp2_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expand_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_expm1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_fftshift_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_hfft2_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifft_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ifftn_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_ihfft2_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft2_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfft_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fft_irfftn_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fill_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_flatten_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fliplr_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_float_power_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_floor_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_fmin_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frac_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_frexp_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ge_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_geometric_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_gt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hsplit_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hstack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_hypot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_i0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_igammac_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_imag_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_copy_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_fill_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_index_select_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isfinite_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isinf_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isposinf_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_isreal_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_item_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lerp_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_lgamma_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_cross_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_diagonal_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_matrix_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_norm_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vecdot_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log10_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log2_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logaddexp2_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_not_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_or_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logical_xor_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_logspace_tensor_overload_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_masked_fill_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_maximum_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_movedim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nan_to_num_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_narrow_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ne_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_empty_strided_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_full_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nextafter_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nextafter_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_alpha_dropout_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_channel_shuffle_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_gelu_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_hardtanh_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_l1_loss_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_leaky_relu_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_log_softmax_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_margin_ranking_loss_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pairwise_distance_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_pixel_shuffle_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_poisson_nll_loss_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu6_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_relu_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softmin_with_dtype_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softplus_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_nn_functional_softshrink_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal__in_place_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_copy_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_permute_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_positive_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_pow_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_prod_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rad2deg_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_randn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_ravel_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_real_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reciprocal_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_renorm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_repeat_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_as_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_reshape_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rot90_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_round_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsqrt_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_rsub_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sgn_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sign_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sin_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinc_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_softmax_with_dtype_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_bessel_j1_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_entr_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_erfcx_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i0e_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_i1e_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_ndtr_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_log_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtr_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_ndtri_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_softmax_with_dtype_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_special_spherical_bessel_j0_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_squeeze_multiple_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_stack_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_std_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sub_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_sum_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_t_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_take_along_dim_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_bfloat16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tanh_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tensor_split_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_to_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trace_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_transpose_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_tril_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_triu_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_bool, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_true_divide_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_trunc_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unflatten_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_float16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unfold_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_unsqueeze_copy_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_as_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_float64, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_copy_cuda_int32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_view_cuda_complex128, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_vstack_cuda_uint8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_where_cuda_int8, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_complex32, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int16, test/test_ops.py::TestCommonCUDA::test_python_ref_torch_fallback__refs_zeros_cuda_int64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_T_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___radd___cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager___rpow___cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_acosh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_addmm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_allclose_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_as_strided_partial_views_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_asinh_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_atleast_3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_bernoulli_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_byte_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cat_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_char_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cholesky_inverse_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_physical_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_conj_physical_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_copysign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_cov_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_copy_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_scatter_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_diagonal_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_digamma_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_floor_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_div_trunc_rounding_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_dot_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_fftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_hfftn_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_fft_irfftn_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_flatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_gather_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_index_put_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isinf_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isnan_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_isnan_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_kron_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_det_singular_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_eig_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_householder_product_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_matrix_rank_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_multi_dot_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_pinv_hermitian_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_ex_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_linalg_tensorsolve_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_log_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logical_and_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_logsumexp_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mH_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_masked_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_maximum_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mean_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_mm_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_movedim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nanmean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_neg_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_ones_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_new_zeros_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv1d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_conv2d_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_pool3d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multi_head_attention_forward_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nn_functional_selu_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_nonzero_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_norm_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_normal_in_place_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_ormqr_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_pinverse_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_prod_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_real_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_reshape_as_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_roll_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_rsqrt_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_scalar_tensor_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_select_scatter_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sign_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signal_windows_hann_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_signbit_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_sinh_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_y0_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_special_bessel_y1_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_square_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_stft_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_take_along_dim_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_triu_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unflatten_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unique_consecutive_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_unsqueeze_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_var_mean_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_as_real_cuda_complex64, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_copy_cuda_float32, test/test_ops.py::TestCommonCUDA::test_variant_consistency_eager_view_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_acos_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_as_strided_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atanh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_atleast_1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_block_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cartesian_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_cfloat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_copysign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_diff_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_dist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_div_floor_rounding_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_ifft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_irfftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_index_reduce_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_inner_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_linalg_svdvals_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_mv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_avg_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_adaptive_max_pool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_interpolate_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool1d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_nn_functional_upsample_nearest_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_pca_lowrank_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_round_decimals_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sgn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_entr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_trapz_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_tril_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unbind_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_var_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_backward_view_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___radd___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input___rmod___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input__unsafe_masked_index_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_all_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_baddbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_char_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cholesky_inverse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_cumsum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_digamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_dot_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_double_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_fft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_ihfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_flatten_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_fliplr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_float_power_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_gradient_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hsplit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_i0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_igammac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_index_fill_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_binary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_lu_factor_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_linspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mT_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_masked_softmin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_multinomial_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_conv_transpose3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_nn_functional_unfold_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_outer_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polar_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_polygamma_polygamma_n_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_randn_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_renorm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_resize__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_round_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_rsqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sign_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sinh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_laguerre_polynomial_l_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_list_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_t_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_triu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_consecutive_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unique_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_cow_input_unsqueeze_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad___rpow___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_arange_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_broadcast_to_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_byte_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_cov_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_diagonal_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erf_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_expand_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_fftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_flip_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_half_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_int_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_isclose_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linalg_vector_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_normal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logaddexp2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_long_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_lu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_median_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_masked_normalize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_matrix_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanmedian_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nanquantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_native_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_ne_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_neg_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_cosine_similarity_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_dropout_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_fractional_max_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_group_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_multilabel_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_nn_functional_relu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_reciprocal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_resolve_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_signal_windows_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_bessel_y0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_i1e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_split_with_sizes_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_t_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_view_as_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_where_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_forward_ad_zeros_like_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_addmv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_alias_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_bucketize_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_conj_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diag_embed_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagflat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_diagonal_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_erfinv_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_exp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_expand_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_fft_irfft2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_frexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_geometric_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_index_select_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isnan_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_isreal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_kthvalue_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lerp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_lgamma_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cond_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_cross_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_linalg_vander_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logaddexp_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_logspace_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_masked_amin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_meshgrid_list_of_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_mul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_batch_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_celu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_elu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_grid_sample_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_interpolate_nearest-exact_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_max_unpool3d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_nll_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_pdist_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_prelu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_selu_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_polygamma_polygamma_n_2_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_randn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rot90_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_rsub_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_scatter_add_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_select_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_signal_windows_hann_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_slice_scatter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_softmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_erfcx_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_i1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_sqrt_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_tensor_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_trunc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unfold_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_unsafe_split_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_operator_var_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_H_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rdiv___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay___rmatmul___cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__chunk_cat_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay__native_batch_norm_legit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_acosh_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addbmm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_addcmul_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_angle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_atleast_3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_broadcast_tensors_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_cholesky_solve_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_contiguous_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_corrcoef_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_diag_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_fftn_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_ifftshift_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_fft_rfft_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_frac_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_full_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_grid_sampler_2d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_histc_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_hstack_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_index_reduce_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_isin_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_jiterator_unary_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_det_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_diagonal_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_householder_product_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_ldl_factor_ex_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_linalg_lstsq_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logical_and_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_logit_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_amax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_argmax_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_masked_std_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_max_reduction_no_dim_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_msort_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nanmean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_native_layer_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nextafter_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_avg_pool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_l1_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_linear_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_local_response_norm_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_logsigmoid_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_pool1d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool2d_grad_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_max_unpool3d_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_normal_in_place_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ones_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_permute_copy_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_prod_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_quantile_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_ravel_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_real_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_resize_as__cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_signal_windows_general_cosine_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_bessel_y1_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_i0e_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_log_ndtr_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_std_mean_unbiased_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_sum_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_to_sparse_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_torch_ops_aten__safe_softmax_default_cuda_float32, test/test_ops.py::TestCompositeComplianceCUDA::test_view_replay_var_mean_unbiased_cuda_float32, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs__conversions_long_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_any_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_as_strided_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_atleast_2d_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_clone_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_conj_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_conj_physical_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_cumsum_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_dstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_fft_irfft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isclose_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_isnan_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linspace_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_neg_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_new_full_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_nn_functional_pairwise_distance_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_copy_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_permute_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_ravel_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_sgn_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_special_softmax_with_dtype_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_take_along_dim_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_tril_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_var_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_var_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_view_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_vstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view__refs_zeros_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_addmv_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_as_strided_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_as_strided_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_cholesky_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_chunk_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_corrcoef_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_diagonal_scatter_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_dstack_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_fft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_hfft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifft2_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fft_ifftshift_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_fliplr_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_index_fill_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cholesky_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_cholesky_ex_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_eigvals_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_ldl_factor_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_lstsq_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_matrix_rank_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_norm_subgradients_at_zero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linalg_svdvals_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_linspace_tensor_overload_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_log_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_logical_and_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_masked_cumprod_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nn_functional_pixel_shuffle_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_nonzero_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_norm_fro_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_normal_in_place_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_ones_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_pow_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_repeat_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_reshape_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_resize_as__cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rot90_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_rsub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_square_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_std_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_stft_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_sub_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_tensordot_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unfold_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_unsqueeze_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_mean_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_var_mean_unbiased_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_view_as_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_conj_view_vsplit_cuda_complex64, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view___getitem___cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_cfloat_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs__conversions_float_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_partial_views_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_as_strided_scatter_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_asin_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_asinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_block_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_chunk_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_column_stack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_contiguous_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_expand_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_fft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_ifft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_fft_irfft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_imag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linalg_svdvals_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_linspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_log10_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logical_xor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logspace_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_pow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_randn_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_reshape_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_rsqrt_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_special_softmax_with_dtype_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_stft_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_transpose_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_var_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view__refs_vstack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_acosh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_addmm_decomposed_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_all_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atanh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_atleast_2d_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_block_diag_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_clone_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_column_stack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_combinations_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_conj_physical_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_cross_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_eq_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_exp2_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_exp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_expand_as_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_float_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_full_like_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_inner_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_ldexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_lu_factor_ex_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_norm_subgradients_at_zero_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_pinv_singular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_slogdet_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_solve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_tensorsolve_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_linalg_vander_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logcumsumexp_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_logical_xor_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_lu_unpack_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_masked_prod_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_matmul_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_meshgrid_variadic_tensors_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_movedim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nanmean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_narrow_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_empty_strided_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_new_full_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_l1_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_circular_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_pad_reflect_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_tanhshrink_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_nn_functional_triplet_margin_loss_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_norm_fro_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sinh_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_sparse_sampled_addmm_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_std_mean_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_svd_lowrank_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_t_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_take_along_dim_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_to_sparse_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_transpose_copy_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_conj_view_zeros_cuda_complex128, test/test_ops.py::TestMathBitsCUDA::test_neg_view___radd___cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_add_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_allclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_as_strided_partial_views_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_atleast_1d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_block_diag_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_cauchy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_clone_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_constant_pad_nd_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_copysign_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_count_nonzero_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_div_no_rounding_mode_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_fft_fftshift_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_index_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_isclose_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_logical_or_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_maximum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_meshgrid_list_of_tensors_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_new_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_hardshrink_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_prelu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_selu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_nn_functional_softmin_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_select_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_log_ndtr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_tril_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_triu_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_unsqueeze_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__refs_zeros_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view__upsample_bilinear2d_aa_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_amin_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_atan_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_cdouble_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_contiguous_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_diagonal_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_div_trunc_rounding_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_einsum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_eq_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_equal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_expand_as_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifft_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_fft_ifftshift_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_flip_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_full_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_gather_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ge_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_reduce_amax_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_index_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_jiterator_unary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cholesky_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_cross_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_diagonal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_eigvals_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_householder_product_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_lu_solve_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_rank_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_matrix_rank_hermitian_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linalg_solve_triangular_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_linspace_tensor_overload_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_log_softmax_with_dtype_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logaddexp_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logit_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_logspace_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_median_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_masked_select_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_max_binary_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_max_reduction_no_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_maximum_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_median_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_min_reduction_with_dim_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_msort_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_mul_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nanquantile_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ne_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_full_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_new_ones_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_binary_cross_entropy_with_logits_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_conv_transpose3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_cosine_embedding_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_dropout3d_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_hinge_embedding_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_logsigmoid_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_mish_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_nn_functional_multilabel_soft_margin_loss_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_norm_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_ones_like_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_polygamma_polygamma_n_3_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_positive_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_qr_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_reciprocal_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_resolve_neg_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_rsqrt_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_scalar_tensor_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_select_scatter_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sgn_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_sparse_mm_reduce_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_hermite_polynomial_he_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_special_modified_bessel_i0_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_t_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_to_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_torch_ops_aten__safe_softmax_default_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unflatten_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_unsqueeze_copy_cuda_float64, test/test_ops.py::TestMathBitsCUDA::test_neg_view_var_cuda_float64, test/test_ops.py::TestFakeTensorCUDA::test_fake_H_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake__batch_norm_with_update_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_argwhere_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_atleast_3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast___rmul___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_abs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_addr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_any_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_argsort_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_as_strided_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_asinh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_atanh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cauchy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cholesky_solve_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_column_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_complex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_constant_pad_nd_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumprod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_cumulative_trapezoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dsplit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_exponential_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_fft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fft_ifftshift_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_fmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_imag_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_isneginf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_jiterator_2inputs_2outputs_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_det_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_eigh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_lu_factor_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_linalg_qr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logical_xor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_lt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_min_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_multinomial_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_binary_cross_entropy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_cosine_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_huber_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_multi_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_pad_constant_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_softsign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_nn_functional_tanhshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_polygamma_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resize__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_resize_as__cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_select_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_bessel_y0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_hermite_polynomial_h_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_ndtr_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_special_zeta_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_split_with_sizes_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_sub_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_t_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_autocast_view_as_real_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_fake_bitwise_xor_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_fake_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_broadcast_shapes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_bucketize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cdouble_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ceil_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_T_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp__softmax_backward_data_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_acos_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_acosh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addcdiv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_addcmul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_alias_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_broadcast_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_diagonal_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_einsum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_exp2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_expand_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_fft_hfftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_index_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_kthvalue_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cholesky_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_matrix_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_multi_dot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_pinv_singular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_linalg_vecdot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_masked_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_maximum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nansum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_native_batch_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_alpha_dropout_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_avg_pool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_conv2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_hardsigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_interpolate_bilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_max_unpool2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_nn_functional_poisson_nll_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_normal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_permute_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_pow_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_reshape_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_round_decimals_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_amin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_scatter_reduce_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sigmoid_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_sin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_stack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_stft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_amp_tile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_add_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_addmm_decomposed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_atan_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_block_diag_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_bmm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cartesian_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_clamp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_copysign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_cummax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_div_no_rounding_mode_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_double_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_erfc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_expand_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_fft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_ihfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_fft_irfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_frexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_gather_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cond_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_cross_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_matrix_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_norm_subgradients_at_zero_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_linalg_pinv_hermitian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_logit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mH_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_masked_std_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_max_reduction_with_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_avg_pool3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_adaptive_max_pool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_conv3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_leaky_relu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_margin_ranking_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_multilabel_soft_margin_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_normalize_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_scaled_dot_product_attention_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_nn_functional_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_reshape_as_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_rsqrt_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_scatter_reduce_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_slice_scatter_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i0e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_i1e_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_squeeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_sum_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_svd_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_take_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_tensordot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_unfold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_var_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_view_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_crossref_backward_no_amp_vstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diag_embed_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diagonal_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_diff_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_digamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_dstack_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_empty_like_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_hfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_ifft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fft_irfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_flipud_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_fmod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_ge_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_geometric_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_histc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_index_reduce_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_diagonal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_linalg_vander_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_logical_or_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_lu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_amax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_meshgrid_variadic_tensors_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nanmedian_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_conv_transpose2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_gelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_hardswish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_instance_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_interpolate_trilinear_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_reflect_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_pad_replicate_negative_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_prelu_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_nn_functional_threshold_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_prod_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_quantile_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_repeat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_repeat_interleave_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_scatter_reduce_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_bartlett_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signal_windows_general_hamming_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_signbit_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_softmax_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_special_xlog1py_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_list_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_split_with_sizes_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_tril_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_true_divide_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_fake_var_mean_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops___rsub___cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops__segment_reduce_lengths_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_addmv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_angle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_bfloat16_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_chalf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_combinations_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_corrcoef_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_diagflat_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erf_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_erfinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_expm1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_hfft_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_ifftn_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_fft_rfft2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_float_power_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_floor_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_hypot_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_igammac_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_index_put_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_istft_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_4inputs_with_extra_args_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_jiterator_binary_return_by_ref_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_lgamma_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_eig_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_pinv_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_slogdet_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_ex_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_linalg_solve_triangular_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log2_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logical_and_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_logaddexp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_masked_softmin_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_matrix_exp_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_max_binary_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_median_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_mul_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_ne_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout2d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_dropout3d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_embedding_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_hinge_embedding_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_layer_norm_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_max_unpool1d_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_mish_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_mse_loss_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pad_replicate_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_nn_functional_pixel_unshuffle_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_fro_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_norm_nuc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_pca_lowrank_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_polygamma_polygamma_n_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_reciprocal_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_round_decimals_neg_3_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_select_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_short_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sign_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_signal_windows_hann_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_sinc_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_airy_ai_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_scaled_modified_bessel_k0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_scaled_modified_bessel_k1_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_special_spherical_bessel_j0_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_copy_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_squeeze_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_std_unbiased_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_take_along_dim_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_tensor_split_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_to_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_torch_ops_aten__efficient_attention_forward_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unique_consecutive_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_pointwise_ops_unsafe_chunk_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_arange_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_linspace_tensor_overload_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_logspace_tensor_overload_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout__refs_ones_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_arange_cuda_int64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bfloat16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_bool, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_full_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_float16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_int16, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_linspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_complex128, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_logspace_tensor_overload_cuda_uint8, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_complex64, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_float32, test/test_ops.py::TestFakeTensorCUDA::test_strided_layout_zeros_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags___rmatmul___cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_abs_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_acos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_as_strided_partial_views_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_asin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_asinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_column_stack_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_conj_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_cumprod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_exp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_hfftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fft_ifftn_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_gt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_index_fill_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isnan_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_isposinf_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_istft_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags__refs_item_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_linalg_vector_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log10_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_log_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_native_layer_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_new_empty_strided_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_hardtanh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softmin_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_nn_functional_softshrink_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_normal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_normal_number_mean_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_positive_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_prod_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_select_scatter_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_bessel_j0_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_ndtri_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_xlog1py_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_special_zeta_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_sqrt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_squeeze_multiple_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_t_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unbind_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_unsqueeze_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags__refs_view_as_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_acos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_arange_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_argmax_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_copy_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_as_strided_partial_views_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_asinh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_baddbmm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_byte_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cdist_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_char_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_combinations_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_complex_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_copysign_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cos_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_cummin_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_einsum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_fft_irfft2_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_float_power_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_gcd_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_gt_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_hypot_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_lcm_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_le_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cholesky_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_cond_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_det_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_diagonal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eigh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_eigvalsh_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_ldl_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_lu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_rank_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_linalg_matrix_rank_hermitian_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_log_softmax_with_dtype_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_logical_xor_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mH_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_logaddexp_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_masked_sum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_maximum_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_msort_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_mv_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_native_dropout_backward_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_adaptive_max_pool3d_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_celu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_dropout_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_embedding_bag_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_feature_alpha_dropout_with_train_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_glu_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_interpolate_bicubic_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_pad_circular_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_smooth_l1_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_soft_margin_loss_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_threshold_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_nn_functional_upsample_bilinear_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_norm_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_ones_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polar_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_polygamma_polygamma_n_4_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_reciprocal_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_repeat_interleave_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_reshape_as_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resize__cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_resolve_neg_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_round_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_rsub_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_searchsorted_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_sinc_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_erfcx_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_special_modified_bessel_i1_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_squeeze_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_t_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_take_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_topk_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_transpose_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_triangular_solve_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_triu_indices_cuda_int64, test/test_ops.py::TestTagsCUDA::test_tags_unflatten_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_var_mean_unbiased_cuda_float32, test/test_ops.py::TestTagsCUDA::test_tags_view_as_real_cuda_complex64, test/test_ops.py::TestTagsCUDA::test_tags_xlogy_cuda_float32 2024-11-01T17:29:05.2337605Z 2024-11-01T17:29:05.2337908Z Running test_unary_ufuncs 1/1 ... [2024-11-01 17:29:04.980822] 2024-11-01T17:29:05.2338429Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:29:05.2339720Z Executing ['/opt/conda/envs/py_3.10/bin/python', '-bb', 'test_unary_ufuncs.py', '-m', 'not serial', '--shard-id=1', '--num-shards=1', '-v', '-vv', '-rfEX', '-p', 'no:xdist', '--use-pytest', '-x', '--reruns=2', '--import-slow-tests', '--import-disabled-tests'] ... [2024-11-01 17:29:04.981403] 2024-11-01T17:31:38.2879296Z 2024-11-01T17:31:38.2885570Z test_binary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_binary_ufuncs_1.1_82f345029025d0e9_.log 2024-11-01T17:31:38.7600468Z Running 12850 items in this shard: test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___add___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___and___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___eq___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___floordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ge___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___gt___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iadd___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___iand___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ifloordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ilshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___imul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ior___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ipow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___irshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___isub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___itruediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ixor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___le___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___lt___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___matmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___mul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ne___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___or___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___pow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___radd___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rand___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rfloordiv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rlshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmatmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmod___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rmul___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___ror___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rpow___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rrshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rshift___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rsub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rtruediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___rxor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___sub___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___truediv___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test___xor___not_implemented_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_broadcast_empty_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_add_with_tail_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_addcmul_scalars_as_floats_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_addsub_half_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_atan2_edgecases_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_batch_vs_slicing_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_op_mem_overlap_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_op_scalar_device_unspecified_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_binary_ops_with_scalars_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bitwise_ops_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_bool_tensor_comparison_ops_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_broadcasting_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cdiv_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cmul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_comparison_ops_type_promotion_and_broadcasting_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_div_underflow_overflow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_div_underflow_overflow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_complex_scalar_pow_tensor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_large_dim_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_size1_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_every_other_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_contig_vs_transposed_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_copysign_subgradient_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cpow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cpu_tensor_pow_cuda_scalar_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cremainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cross_device_binary_ops_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cross_device_inplace_error_msg_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_csub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cuda_tensor_pow_scalar_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_cumulative_trapezoid_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_and_floordiv_script_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_and_floordiv_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_modes_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_nonfinite_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_div_rounding_numpy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divide_by_zero_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_divmul_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_power_exceptions_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_scalar_pow_float_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_float_scalar_pow_float_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_div_extremal_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_div_extremal_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_div_extremal_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_div_extremal_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_scalar_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_tensor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_floor_divide_zero_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_float_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_by_zero_integral_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_fmod_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_complex_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cross_device_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_heaviside_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_idiv_and_ifloordiv_vs_python_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_inplace_division_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_inplace_dunders_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_int_and_float_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_int_tensor_pow_neg_ints_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_ldexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cpu_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cpu_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_lerp_lowp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_and_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_or_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_logical_xor_with_nontrivial_alignment_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_long_tensor_pow_floats_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_and_minimum_subgradient_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex128_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_complex_cuda_complex64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_cross_device_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_float_nan_and_inf_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_forward_ad_float32_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_int_and_bool_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bfloat16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_maximum_minimum_type_promotion_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_min_max_binary_op_nan_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_chalf_tensor_and_cpu_scalar_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_mul_intertype_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_muldiv_scalar_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_bfloat16_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_expand_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___radd___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rand___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rdiv___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmod___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rmul___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___ror___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rpow___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rsub___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index___rxor___cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs__conversions_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index__refs_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_atan2_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_left_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_right_shift_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_complex_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_copysign_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_floor_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_no_rounding_mode_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_div_trunc_rounding_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmax_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmin_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_hypot_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igamma_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_igammac_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_index_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ldexp_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logaddexp_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_mul_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_nextafter_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_polar_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_rsub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_h_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_hermite_polynomial_he_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_laguerre_polynomial_l_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_legendre_polynomial_p_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_t_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_u_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_v_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_shifted_chebyshev_polynomial_w_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_xlog1py_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_special_zeta_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_true_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_non_contig_xlogy_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___radd___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rdiv___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rmod___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rmul___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rpow___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable___rsub___cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs__conversions_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs__conversions_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable__refs_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_atan2_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_complex_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_copysign_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_floor_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_no_rounding_mode_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_div_trunc_rounding_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmax_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmin_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_hypot_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_igamma_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_igammac_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ldexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logaddexp_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_mul_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_nextafter_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_polar_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_rsub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_hermite_polynomial_h_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_hermite_polynomial_he_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_laguerre_polynomial_l_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_legendre_polynomial_p_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_t_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_u_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_v_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_shifted_chebyshev_polynomial_w_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_xlog1py_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_special_zeta_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_true_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_not_broadcastable_xlogy_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_out_resize_warning_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex_extremal_failing_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_complex_extremal_failing_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_inplace_resizing_exception_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_base_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_overloads_mem_overlap_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_pow_scalar_type_promotion_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rdiv_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_extremal_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_large_values_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values__refs_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_add_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_bitwise_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_max_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_clamp_min_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_eq_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_float_power_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_floor_divide_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_fmod_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gcd_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ge_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_gt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_heaviside_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_isclose_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_jiterator_binary_return_by_ref_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lcm_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_le_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_and_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_or_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_logical_xor_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_lt_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_max_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_maximum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_min_binary_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_minimum_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_ne_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_pow_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_remainder_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_small_values_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_reference_numerics_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_fmod_large_dividend_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_fmod_large_dividend_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_remainder_overflow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_rpow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support__refs_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_add_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_bitwise_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_max_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_max_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_min_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_clamp_min_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_eq_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_float_power_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_floor_divide_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_floor_divide_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_fmod_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_fmod_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gcd_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ge_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ge_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_gt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_heaviside_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_heaviside_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_isclose_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_jiterator_binary_return_by_ref_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lcm_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_le_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_le_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_and_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_or_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_logical_xor_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lt_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_lt_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_max_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_max_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_maximum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_maximum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_min_binary_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_min_binary_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_minimum_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_minimum_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_ne_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_pow_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_remainder_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_remainder_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_scalar_support_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_shift_limits_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_signed_shift_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_complex128, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_complex64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_cuda_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_sub_typing_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_tensor_pow_tensor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_trapezoid_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_true_divide_out_cuda_bfloat16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_true_divide_out_cuda_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___radd___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rand___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rdiv___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rmod___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rmul___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___ror___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rpow___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rsub___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion___rxor___cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs__conversions_complex_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs__conversions_polar_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_left_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_right_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_bitwise_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_clamp_max_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_clamp_min_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_copysign_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_floor_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_no_rounding_mode_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_div_trunc_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_eq_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_float_power_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_floor_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmax_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmin_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_fmod_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_gcd_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_ge_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_gt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_heaviside_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_hypot_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_igamma_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_igammac_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_isclose_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_lcm_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_le_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logaddexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_logical_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_lt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_maximum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_minimum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_ne_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_nextafter_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_remainder_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_rsub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_special_xlog1py_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_special_zeta_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_sub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_true_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion__refs_xlogy_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_add_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_atan2_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_left_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_right_shift_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_bitwise_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_clamp_max_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_clamp_min_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_complex_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_copysign_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_floor_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_no_rounding_mode_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_div_trunc_rounding_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_eq_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_float_power_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_floor_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmax_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmin_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_fmod_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_gcd_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ge_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_gt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_heaviside_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_hypot_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_igamma_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_igammac_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_isclose_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_jiterator_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_jiterator_binary_return_by_ref_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_lcm_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ldexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_le_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logaddexp_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_and_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_or_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_logical_xor_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_lt_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_max_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_maximum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_min_binary_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_minimum_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_mul_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_ne_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_nextafter_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_polar_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_pow_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_remainder_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_rsub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_t_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_u_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_v_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_chebyshev_polynomial_w_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_hermite_polynomial_h_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_hermite_polynomial_he_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_laguerre_polynomial_l_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_legendre_polynomial_p_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_t_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_u_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_v_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_shifted_chebyshev_polynomial_w_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_xlog1py_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_special_zeta_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_sub_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_true_divide_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_type_promotion_xlogy_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_bfloat16_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_cuda_uint8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_gradients_cuda_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_xlogy_xlog1py_scalar_type_promotion_cuda, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_bool_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_float64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int16_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int32_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int64_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_int8_uint8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_bool, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_float32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_float64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int16, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int32, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int64, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_int8, test/test_binary_ufuncs.py::TestBinaryUfuncsCUDA::test_zeta_cuda_uint8_uint8 2024-11-01T17:31:39.1892267Z 2024-11-01T17:33:56.7104991Z 2024-11-01T17:33:56.7106709Z test_unary_ufuncs 1/1 was successful, full logs can be found in artifacts with path test/test-reports/test_unary_ufuncs_1.1_4479d36d520a2223_.log 2024-11-01T17:33:57.5999883Z Running 25067 items in this shard: test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_angle_complex_to_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_angle_complex_to_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_big_number_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_signed_zero_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_signed_zero_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_zero_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_abs_zero_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_batch_vs_slicing_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_complex_edge_values_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_large_dim_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_size1_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_every_other_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_contig_vs_transposed_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_special_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_digamma_special_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_exp_slow_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_float_domains_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_assert_raises_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardshrink_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_backward_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_hardswish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_range3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_i0_special_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_common_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_common_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igamma_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_common_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_common_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_edge_cases_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_igammac_edge_cases_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_isposinf_isneginf_non_boolean_output_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_log1p_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_log1p_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_bfloat16_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expand_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs__conversions_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bfloat16_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_bool_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_byte_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cdouble_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cfloat_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_chalf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_char_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_double_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_float_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_half_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_index_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_int_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_long_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_hardtanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu6_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_rrelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_short_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_non_contig_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_nonzero_empty_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_op_invert_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_polygamma_neg_cuda, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_extremal_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_large_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_normal_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_multigammaln_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small__refs_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_abs_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_acosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_angle_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_asinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_atanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_bitwise_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_ceil_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_conj_physical_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cos_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_cosh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_deg2rad_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_digamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_erfinv_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_exp_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_expm1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_fill_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_floor_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frac_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_frexp_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_imag_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isfinite_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isnan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isneginf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isposinf_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_isreal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_jiterator_unary_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_lgamma_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log10_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log1p_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_log_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logical_not_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_logit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_mvlgamma_mvlgamma_p_5_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nan_to_num_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_neg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_celu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_elu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_hardsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_logsigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_mish_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_prelu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_relu_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_selu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softplus_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_softsign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_tanhshrink_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_nn_functional_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_2_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_3_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_polygamma_polygamma_n_4_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_positive_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rad2deg_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_real_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_reciprocal_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_round_decimals_neg_3_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_rsqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sgn_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sigmoid_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sign_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_signbit_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sin_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sinh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_airy_ai_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_j1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_bessel_y1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_entr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_erfcx_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i0e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_i1e_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_log_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_i1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtr_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_ndtri_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_polygamma_special_polygamma_n_0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_scaled_modified_bessel_k1_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_special_spherical_bessel_j0_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_sqrt_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_square_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tan_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_bool, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_tanh_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_reference_numerics_small_trunc_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_complex_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_complex_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_silu_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_sinc_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_bfloat16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_i0_i1_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_log_ndtr_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_log_ndtr_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_ndtr_vs_scipy_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_special_ndtr_vs_scipy_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_complex128, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_complex64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_float32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_float64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int16, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int32, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int64, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_int8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_threshold_cuda_uint8, test/test_unary_ufuncs.py::TestUnaryUfuncsCUDA::test_unary_out_op_mem_overlap_cuda_float64 2024-11-01T17:33:58.4402906Z 2024-11-01T17:33:58.4403224Z GITHUB_RUN_ID, GITHUB_RUN_ATTEMPT, or ARTIFACTS_FILE_SUFFIX not set, not uploading 2024-11-01T17:33:58.4403749Z Uploading artifacts took 0.00 seconds 2024-11-01T17:33:58.4404445Z Running test batch 'tests to run' cost 2517.78 seconds 2024-11-01T17:33:59.8888405Z 2024-11-01T17:33:59.8888968Z real 42m5.052s 2024-11-01T17:33:59.8889431Z user 266m10.464s 2024-11-01T17:33:59.8889787Z sys 44m49.105s 2024-11-01T17:33:59.8890118Z + assert_git_not_dirty 2024-11-01T17:33:59.8897498Z + [[ linux-focal-rocm6.2-py3.10 != *rocm* ]] 2024-11-01T17:33:59.8898052Z + test_aten 2024-11-01T17:33:59.8898738Z + echo 'Running ATen tests with pytorch lib' 2024-11-01T17:33:59.8903034Z Running ATen tests with pytorch lib 2024-11-01T17:33:59.8903569Z + [[ -n '' ]] 2024-11-01T17:33:59.8903985Z + echo 'Running test with the build folder' 2024-11-01T17:33:59.8904459Z Running test with the build folder 2024-11-01T17:33:59.8911877Z + TEST_BASE_DIR=build/bin 2024-11-01T17:33:59.8912956Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libc10.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libc10_hip.so build/bin 2024-11-01T17:33:59.8943619Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libcaffe2_nvrtc.so build/bin 2024-11-01T17:33:59.8958751Z + ln -sf '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libmkldnn*' build/bin 2024-11-01T17:33:59.8972769Z + ln -sf '/opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libnccl*' build/bin 2024-11-01T17:33:59.8995309Z + ln -sf /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_cpu.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_global_deps.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_hip.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorch_python.so /opt/conda/envs/py_3.10/lib/python3.10/site-packages/torch/lib/libtorchbind_test.so build/bin 2024-11-01T17:33:59.9007606Z + ls build/bin 2024-11-01T17:33:59.9045508Z BackoffTest 2024-11-01T17:33:59.9045811Z CMakeFiles 2024-11-01T17:33:59.9046142Z CTestTestfile.cmake 2024-11-01T17:33:59.9046483Z CppSignature_test 2024-11-01T17:33:59.9046810Z Dict_test 2024-11-01T17:33:59.9047164Z Dimname_test 2024-11-01T17:33:59.9047538Z FileStoreTest 2024-11-01T17:33:59.9047902Z HashStoreTest 2024-11-01T17:33:59.9048262Z IListRef_test 2024-11-01T17:33:59.9048617Z KernelFunction_test 2024-11-01T17:33:59.9048957Z List_test 2024-11-01T17:33:59.9049241Z MaybeOwned_test 2024-11-01T17:33:59.9049549Z NamedTensor_test 2024-11-01T17:33:59.9049870Z ProcessGroupGlooTest 2024-11-01T17:33:59.9050216Z StorageUtils_test 2024-11-01T17:33:59.9050523Z TCPStoreTest 2024-11-01T17:33:59.9051721Z aot_model_compiler_test 2024-11-01T17:33:59.9052076Z apply_utils_test 2024-11-01T17:33:59.9052402Z atest 2024-11-01T17:33:59.9052686Z backend_fallback_test 2024-11-01T17:33:59.9053021Z basic 2024-11-01T17:33:59.9053296Z broadcast_test 2024-11-01T17:33:59.9053589Z c10_Bitset_test 2024-11-01T17:33:59.9053929Z c10_CompileTimeFunctionPointer_test 2024-11-01T17:33:59.9054323Z c10_ConstexprCrc_test 2024-11-01T17:33:59.9054611Z c10_DeadlockDetection_test 2024-11-01T17:33:59.9054909Z c10_DeviceGuard_test 2024-11-01T17:33:59.9062882Z c10_Device_test 2024-11-01T17:33:59.9063147Z c10_DispatchKeySet_test 2024-11-01T17:33:59.9063478Z c10_Half_test 2024-11-01T17:33:59.9063733Z c10_InlineDeviceGuard_test 2024-11-01T17:33:59.9064032Z c10_InlineStreamGuard_test 2024-11-01T17:33:59.9064315Z c10_LeftRight_test 2024-11-01T17:33:59.9064590Z c10_Metaprogramming_test 2024-11-01T17:33:59.9064878Z c10_NetworkFlow_test 2024-11-01T17:33:59.9065138Z c10_Scalar_test 2024-11-01T17:33:59.9065408Z c10_SizesAndStrides_test 2024-11-01T17:33:59.9065683Z c10_StreamGuard_test 2024-11-01T17:33:59.9065947Z c10_SymInt_test 2024-11-01T17:33:59.9066212Z c10_Synchronized_test 2024-11-01T17:33:59.9066677Z c10_ThreadLocal_test 2024-11-01T17:33:59.9067069Z c10_TypeIndex_test 2024-11-01T17:33:59.9067330Z c10_TypeList_test 2024-11-01T17:33:59.9067585Z c10_TypeTraits_test 2024-11-01T17:33:59.9067849Z c10_accumulate_test 2024-11-01T17:33:59.9068110Z c10_bfloat16_test 2024-11-01T17:33:59.9068360Z c10_bit_cast_test 2024-11-01T17:33:59.9068620Z c10_complex_math_test 2024-11-01T17:33:59.9068886Z c10_complex_test 2024-11-01T17:33:59.9069132Z c10_cow_test 2024-11-01T17:33:59.9069373Z c10_exception_test 2024-11-01T17:33:59.9069629Z c10_flags_test 2024-11-01T17:33:59.9069875Z c10_generic_math_test 2024-11-01T17:33:59.9070163Z c10_hip_HIPAssertionsTest_1_var_test 2024-11-01T17:33:59.9070523Z c10_hip_HIPAssertionsTest_catches_stream 2024-11-01T17:33:59.9070996Z c10_hip_HIPAssertionsTest_catches_thread_and_block_and_device 2024-11-01T17:33:59.9071450Z c10_hip_HIPAssertionsTest_from_2_processes 2024-11-01T17:33:59.9072018Z c10_hip_HIPAssertionsTest_multiple_writes_from_blocks_and_threads 2024-11-01T17:33:59.9072548Z c10_hip_HIPAssertionsTest_multiple_writes_from_multiple_blocks 2024-11-01T17:33:59.9073044Z c10_hip_HIPAssertionsTest_multiple_writes_from_same_block 2024-11-01T17:33:59.9073446Z c10_hip_HIPTest 2024-11-01T17:33:59.9073754Z c10_intrusive_ptr_benchmark 2024-11-01T17:33:59.9074100Z c10_intrusive_ptr_test 2024-11-01T17:33:59.9074421Z c10_irange_test 2024-11-01T17:33:59.9074719Z c10_lazy_test 2024-11-01T17:33:59.9074999Z c10_logging_test 2024-11-01T17:33:59.9075297Z c10_optional_test 2024-11-01T17:33:59.9075614Z c10_ordered_preserving_dict_test 2024-11-01T17:33:59.9075954Z c10_registry_test 2024-11-01T17:33:59.9076219Z c10_small_vector_test 2024-11-01T17:33:59.9076485Z c10_ssize_test 2024-11-01T17:33:59.9076735Z c10_string_util_test 2024-11-01T17:33:59.9077000Z c10_string_view_test 2024-11-01T17:33:59.9077284Z c10_tempfile_test 2024-11-01T17:33:59.9077534Z c10_typeid_test 2024-11-01T17:33:59.9077800Z cmake_install.cmake 2024-11-01T17:33:59.9078117Z cpu_allocator_test 2024-11-01T17:33:59.9078419Z cpu_generator_test 2024-11-01T17:33:59.9078745Z cpu_profiling_allocator_test 2024-11-01T17:33:59.9079081Z cpu_rng_test 2024-11-01T17:33:59.9079366Z dispatch_key_set_test 2024-11-01T17:33:59.9079668Z dlconvertor_test 2024-11-01T17:33:59.9079929Z example_allreduce 2024-11-01T17:33:59.9080194Z extension_backend_test 2024-11-01T17:33:59.9080468Z half_test 2024-11-01T17:33:59.9080709Z hip_apply_test 2024-11-01T17:33:59.9080964Z hip_complex_math_test 2024-11-01T17:33:59.9081244Z hip_complex_test 2024-11-01T17:33:59.9081504Z hip_distributions_test 2024-11-01T17:33:59.9081783Z hip_dlconvertor_test 2024-11-01T17:33:59.9082048Z hip_generator_test 2024-11-01T17:33:59.9082303Z hip_half_test 2024-11-01T17:33:59.9082554Z hip_integer_divider_test 2024-11-01T17:33:59.9082839Z hip_optional_test 2024-11-01T17:33:59.9083113Z hip_packedtensoraccessor_test 2024-11-01T17:33:59.9083421Z hip_vectorized_test 2024-11-01T17:33:59.9083685Z inline_container_test 2024-11-01T17:33:59.9083959Z ivalue_test 2024-11-01T17:33:59.9084215Z kernel_function_legacy_test 2024-11-01T17:33:59.9084508Z kernel_function_test 2024-11-01T17:33:59.9084783Z kernel_lambda_legacy_test 2024-11-01T17:33:59.9085063Z kernel_lambda_test 2024-11-01T17:33:59.9085327Z kernel_stackbased_test 2024-11-01T17:33:59.9085592Z lazy_tensor_test 2024-11-01T17:33:59.9085842Z legacy_vmap_test 2024-11-01T17:33:59.9086179Z libc10.so 2024-11-01T17:33:59.9086415Z libc10_hip.so 2024-11-01T17:33:59.9086657Z libcaffe2_nvrtc.so 2024-11-01T17:33:59.9086974Z 'libmkldnn*' 2024-11-01T17:33:59.9087224Z 'libnccl*' 2024-11-01T17:33:59.9087461Z libtorch.so 2024-11-01T17:33:59.9087693Z libtorch_cpu.so 2024-11-01T17:33:59.9087953Z libtorch_global_deps.so 2024-11-01T17:33:59.9088241Z libtorch_hip.so 2024-11-01T17:33:59.9088487Z libtorch_python.so 2024-11-01T17:33:59.9088743Z libtorchbind_test.so 2024-11-01T17:33:59.9089032Z make_boxed_from_unboxed_functor_test 2024-11-01T17:33:59.9089368Z math_kernel_test 2024-11-01T17:33:59.9089694Z memory_format_test 2024-11-01T17:33:59.9090021Z memory_overlapping_test 2024-11-01T17:33:59.9090318Z mobile_memory_cleanup 2024-11-01T17:33:59.9090593Z native_test 2024-11-01T17:33:59.9090830Z op_allowlist_test 2024-11-01T17:33:59.9091099Z op_registration_test 2024-11-01T17:33:59.9091359Z operator_name_test 2024-11-01T17:33:59.9091620Z operators_test 2024-11-01T17:33:59.9091899Z packedtensoraccessor_test 2024-11-01T17:33:59.9092206Z parallel_benchmark 2024-11-01T17:33:59.9092462Z pow_test 2024-11-01T17:33:59.9092698Z protoc 2024-11-01T17:33:59.9092963Z protoc-3.13.0.0 2024-11-01T17:33:59.9093214Z quantized_test 2024-11-01T17:33:59.9093460Z reduce_ops_test 2024-11-01T17:33:59.9093734Z reportMemoryUsage_test 2024-11-01T17:33:59.9094047Z scalar_tensor_test 2024-11-01T17:33:59.9094317Z scalar_test 2024-11-01T17:33:59.9094567Z static_runtime_bench 2024-11-01T17:33:59.9094908Z static_runtime_test 2024-11-01T17:33:59.9095182Z stride_properties_test 2024-11-01T17:33:59.9095537Z tensor_iterator_test 2024-11-01T17:33:59.9095798Z test_api 2024-11-01T17:33:59.9096030Z test_cpp_rpc 2024-11-01T17:33:59.9096294Z test_dist_autograd 2024-11-01T17:33:59.9096559Z test_edge_op_registration 2024-11-01T17:33:59.9096843Z test_jit 2024-11-01T17:33:59.9097070Z test_lazy 2024-11-01T17:33:59.9097297Z test_mobile_nnc 2024-11-01T17:33:59.9097543Z test_parallel 2024-11-01T17:33:59.9097785Z test_tensorexpr 2024-11-01T17:33:59.9098026Z thread_init_test 2024-11-01T17:33:59.9098390Z torch_shm_manager 2024-11-01T17:33:59.9098660Z tutorial_tensorexpr 2024-11-01T17:33:59.9098920Z type_ptr_test 2024-11-01T17:33:59.9099162Z type_test 2024-11-01T17:33:59.9099397Z undefined_tensor_test 2024-11-01T17:33:59.9099668Z vec_test_all_types_AVX2 2024-11-01T17:33:59.9099946Z vec_test_all_types_AVX512 2024-11-01T17:33:59.9100235Z vec_test_all_types_DEFAULT 2024-11-01T17:33:59.9100524Z verify_api_visibility 2024-11-01T17:33:59.9100789Z weakref_test 2024-11-01T17:33:59.9101035Z wrapdim_test 2024-11-01T17:33:59.9101276Z xla_tensor_test 2024-11-01T17:33:59.9101560Z + aten/tools/run_tests.sh build/bin 2024-11-01T17:33:59.9101906Z + set -e 2024-11-01T17:33:59.9102161Z ++ dirname aten/tools/run_tests.sh 2024-11-01T17:33:59.9102577Z + VALGRIND_SUP=/var/lib/jenkins/pytorch/aten/tools/valgrind.sup 2024-11-01T17:33:59.9103012Z + export CPP_TESTS_DIR=build/bin 2024-11-01T17:33:59.9103339Z + CPP_TESTS_DIR=build/bin 2024-11-01T17:33:59.9103609Z + VALGRIND=OFF 2024-11-01T17:33:59.9105430Z + python test/run_test.py --cpp --verbose -i cpp/basic cpp/atest cpp/scalar_test cpp/broadcast_test cpp/wrapdim_test cpp/apply_utils_test cpp/dlconvertor_test cpp/native_test cpp/scalar_tensor_test cpp/undefined_tensor_test cpp/extension_backend_test cpp/lazy_tensor_test cpp/tensor_iterator_test cpp/Dimname_test cpp/Dict_test cpp/NamedTensor_test cpp/cpu_generator_test cpp/legacy_vmap_test cpp/operators_test 2024-11-01T17:33:59.9950041Z /var/lib/jenkins/pytorch/test/run_test.py:21: DeprecationWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html 2024-11-01T17:33:59.9951191Z import pkg_resources 2024-11-01T17:34:05.1122369Z Downloading https://ossci-metrics.s3.amazonaws.com/disabled-tests-condensed.json to /var/lib/jenkins/pytorch/test/.pytorch-disabled-tests.json 2024-11-01T17:34:05.1222172Z Found test times from artifacts 2024-11-01T17:34:05.1649414Z Found test times from artifacts 2024-11-01T17:34:05.1662988Z Running all tests 2024-11-01T17:34:05.1666158Z Running parallel tests on 2 processes 2024-11-01T17:34:05.1667151Z Name: tests to run (est. time: 0.0min) 2024-11-01T17:34:05.1667960Z Serial tests (0): 2024-11-01T17:34:05.1668632Z Parallel tests (19): 2024-11-01T17:34:05.1669285Z cpp/Dict_test 1/1 2024-11-01T17:34:05.1669943Z cpp/Dimname_test 1/1 2024-11-01T17:34:05.1670662Z cpp/NamedTensor_test 1/1 2024-11-01T17:34:05.1671353Z cpp/apply_utils_test 1/1 2024-11-01T17:34:05.1672035Z cpp/atest 1/1 2024-11-01T17:34:05.1672654Z cpp/basic 1/1 2024-11-01T17:34:05.1673562Z cpp/broadcast_test 1/1 2024-11-01T17:34:05.1674618Z cpp/cpu_generator_test 1/1 2024-11-01T17:34:05.1675507Z cpp/dlconvertor_test 1/1 2024-11-01T17:34:05.1676415Z cpp/extension_backend_test 1/1 2024-11-01T17:34:05.1677315Z cpp/lazy_tensor_test 1/1 2024-11-01T17:34:05.1678119Z cpp/legacy_vmap_test 1/1 2024-11-01T17:34:05.1678966Z cpp/native_test 1/1 2024-11-01T17:34:05.1679695Z cpp/operators_test 1/1 2024-11-01T17:34:05.1680135Z cpp/scalar_tensor_test 1/1 2024-11-01T17:34:05.1680579Z cpp/scalar_test 1/1 2024-11-01T17:34:05.1680943Z cpp/tensor_iterator_test 1/1 2024-11-01T17:34:05.1681327Z cpp/undefined_tensor_test 1/1 2024-11-01T17:34:05.1681729Z cpp/wrapdim_test 1/1 2024-11-01T17:34:05.1682089Z Name: excluded (est. time: 0.0min) 2024-11-01T17:34:05.1685670Z Serial tests (0): 2024-11-01T17:34:05.1686115Z Parallel tests (0): 2024-11-01T17:34:05.1723738Z Running cpp/Dict_test 1/1 ... [2024-11-01 17:34:05.171968] 2024-11-01T17:34:05.1725238Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:05.1730021Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/Dict_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-bfedbfe7c4e82891.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:05.172485] 2024-11-01T17:34:09.1464954Z 2024-11-01T17:34:09.1467163Z cpp/Dict_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dict_test_1.1_52c396911156563d_.log 2024-11-01T17:34:09.1468724Z 2024-11-01T17:34:09.1469321Z Running cpp/Dimname_test 1/1 ... [2024-11-01 17:34:09.146087] 2024-11-01T17:34:09.1470357Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:09.1473537Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/Dimname_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-a002acf42039356d.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:09.146585] 2024-11-01T17:34:12.5210434Z 2024-11-01T17:34:12.5212974Z cpp/Dimname_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dimname_test_1.1_20df4f433d6f86bd_.log 2024-11-01T17:34:12.5214602Z 2024-11-01T17:34:12.5215265Z Running cpp/NamedTensor_test 1/1 ... [2024-11-01 17:34:12.520592] 2024-11-01T17:34:12.5216376Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:12.5220526Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/NamedTensor_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-760395aaa5b6eacb.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:12.521226] 2024-11-01T17:34:15.8956056Z 2024-11-01T17:34:15.8958431Z cpp/NamedTensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.NamedTensor_test_1.1_ff1e9352e60092d8_.log 2024-11-01T17:34:15.8960129Z 2024-11-01T17:34:15.8960948Z Running cpp/apply_utils_test 1/1 ... [2024-11-01 17:34:15.895252] 2024-11-01T17:34:15.8962226Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:15.8966507Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/apply_utils_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-427ddb6bfc5047a2.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:15.895908] 2024-11-01T17:34:19.2703059Z 2024-11-01T17:34:19.2705652Z cpp/apply_utils_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.apply_utils_test_1.1_d8814d8534c44227_.log 2024-11-01T17:34:19.2707342Z 2024-11-01T17:34:19.2707889Z Running cpp/atest 1/1 ... [2024-11-01 17:34:19.269883] 2024-11-01T17:34:19.2708874Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:19.2720703Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/atest', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-fa7e62d39dd24058.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:19.270524] 2024-11-01T17:34:22.6954520Z 2024-11-01T17:34:22.6956773Z cpp/atest 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.atest_1.1_542802cc25f226db_.log 2024-11-01T17:34:22.6958353Z 2024-11-01T17:34:22.6958901Z Running cpp/basic 1/1 ... [2024-11-01 17:34:22.695060] 2024-11-01T17:34:22.6959867Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:22.6963855Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/basic', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-fd64c6402a32293b.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:22.695657] 2024-11-01T17:34:26.0703818Z 2024-11-01T17:34:26.0706066Z cpp/basic 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.basic_1.1_22ad5cd5824a5b55_.log 2024-11-01T17:34:26.0707642Z 2024-11-01T17:34:26.0708872Z Running cpp/broadcast_test 1/1 ... [2024-11-01 17:34:26.069859] 2024-11-01T17:34:26.0710234Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:26.0713485Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/broadcast_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-686e5ab2fe43e275.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:26.070477] 2024-11-01T17:34:29.4449569Z 2024-11-01T17:34:29.4452346Z cpp/broadcast_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.broadcast_test_1.1_314273e78ebd70ef_.log 2024-11-01T17:34:29.4453965Z 2024-11-01T17:34:29.4454669Z Running cpp/cpu_generator_test 1/1 ... [2024-11-01 17:34:29.444437] 2024-11-01T17:34:29.4455937Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:29.4459788Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/cpu_generator_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-9b3f9bc2fdebbc66.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:29.445048] 2024-11-01T17:34:32.8197575Z 2024-11-01T17:34:32.8200243Z cpp/cpu_generator_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cpu_generator_test_1.1_9b99ae2bc5d01b88_.log 2024-11-01T17:34:32.8201809Z 2024-11-01T17:34:32.8202354Z Running cpp/dlconvertor_test 1/1 ... [2024-11-01 17:34:32.819286] 2024-11-01T17:34:32.8203272Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:32.8206073Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/dlconvertor_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-c3016eaed58edf92.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:32.819937] 2024-11-01T17:34:36.1945681Z 2024-11-01T17:34:36.1947469Z cpp/dlconvertor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.dlconvertor_test_1.1_48ccef79146b67b8_.log 2024-11-01T17:34:36.1948359Z 2024-11-01T17:34:36.1948780Z Running cpp/extension_backend_test 1/1 ... [2024-11-01 17:34:36.194110] 2024-11-01T17:34:36.1949400Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:36.1953279Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/extension_backend_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-e7dbffd2377bd90d.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:36.194761] 2024-11-01T17:34:39.5193109Z 2024-11-01T17:34:39.5196110Z cpp/extension_backend_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.extension_backend_test_1.1_29c9ab074c995f28_.log 2024-11-01T17:34:39.5197912Z 2024-11-01T17:34:39.5198598Z Running cpp/lazy_tensor_test 1/1 ... [2024-11-01 17:34:39.518820] 2024-11-01T17:34:39.5199668Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:39.5203481Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/lazy_tensor_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-ce7de03a5970f0d0.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:39.519492] 2024-11-01T17:34:42.8935188Z 2024-11-01T17:34:42.8938542Z cpp/lazy_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.lazy_tensor_test_1.1_c4bcf351dfc13224_.log 2024-11-01T17:34:42.8940284Z 2024-11-01T17:34:42.8940989Z Running cpp/legacy_vmap_test 1/1 ... [2024-11-01 17:34:42.893108] 2024-11-01T17:34:42.8942081Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:42.8945233Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/legacy_vmap_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-8a7081d8ba6636df.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:42.893652] 2024-11-01T17:34:46.2679693Z 2024-11-01T17:34:46.2682987Z cpp/legacy_vmap_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.legacy_vmap_test_1.1_37db96c97cae8fd0_.log 2024-11-01T17:34:46.2684833Z 2024-11-01T17:34:46.2685480Z Running cpp/native_test 1/1 ... [2024-11-01 17:34:46.267367] 2024-11-01T17:34:46.2686549Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:46.2689696Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/native_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-f65d1e3629212bd8.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:46.268030] 2024-11-01T17:34:49.6427714Z 2024-11-01T17:34:49.6430865Z cpp/native_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.native_test_1.1_adf003dd1ea4cdf6_.log 2024-11-01T17:34:49.6432479Z 2024-11-01T17:34:49.6433122Z Running cpp/operators_test 1/1 ... [2024-11-01 17:34:49.642158] 2024-11-01T17:34:49.6434173Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:49.6437428Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/operators_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-37e19b03c5bd2326.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:49.642855] 2024-11-01T17:34:53.0683590Z 2024-11-01T17:34:53.0685737Z cpp/operators_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.operators_test_1.1_89d8ce02eae031d8_.log 2024-11-01T17:34:53.0687398Z 2024-11-01T17:34:53.0688108Z Running cpp/scalar_tensor_test 1/1 ... [2024-11-01 17:34:53.067951] 2024-11-01T17:34:53.0689211Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:53.0693180Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/scalar_tensor_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-740e7e70013be636.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:53.068622] 2024-11-01T17:34:56.4427688Z 2024-11-01T17:34:56.4430802Z cpp/scalar_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.scalar_tensor_test_1.1_77080e168f4943a0_.log 2024-11-01T17:34:56.4432509Z 2024-11-01T17:34:56.4433134Z Running cpp/scalar_test 1/1 ... [2024-11-01 17:34:56.442301] 2024-11-01T17:34:56.4434151Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:56.4437368Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/scalar_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-9cd515492c9c5e0d.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:56.442913] 2024-11-01T17:34:59.8175430Z 2024-11-01T17:34:59.8178473Z cpp/scalar_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.scalar_test_1.1_fbba941edbe68816_.log 2024-11-01T17:34:59.8180110Z 2024-11-01T17:34:59.8180800Z Running cpp/tensor_iterator_test 1/1 ... [2024-11-01 17:34:59.817072] 2024-11-01T17:34:59.8181949Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:34:59.8189383Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/tensor_iterator_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-d3d1cda011d1fe98.xml', '-x', '--reruns=2'] ... [2024-11-01 17:34:59.817701] 2024-11-01T17:35:03.1919609Z 2024-11-01T17:35:03.1922348Z cpp/tensor_iterator_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.tensor_iterator_test_1.1_95db5bc415035ac9_.log 2024-11-01T17:35:03.1924043Z 2024-11-01T17:35:03.1924766Z Running cpp/undefined_tensor_test 1/1 ... [2024-11-01 17:35:03.191569] 2024-11-01T17:35:03.1925911Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:03.1929749Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/undefined_tensor_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-ee4f6736b652a9c3.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:03.192197] 2024-11-01T17:35:06.6164706Z 2024-11-01T17:35:06.6167479Z cpp/undefined_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.undefined_tensor_test_1.1_65754cead3ce64a0_.log 2024-11-01T17:35:06.6169310Z 2024-11-01T17:35:06.6169935Z Running cpp/wrapdim_test 1/1 ... [2024-11-01 17:35:06.615952] 2024-11-01T17:35:06.6171000Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:06.6174226Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/wrapdim_test', '-m', 'serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-1d6f6c031bf63b1b.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:06.616556] 2024-11-01T17:35:09.9910920Z 2024-11-01T17:35:09.9913673Z cpp/wrapdim_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.wrapdim_test_1.1_444a50b500eb51ee_.log 2024-11-01T17:35:09.9915250Z 2024-11-01T17:35:09.9928752Z Running cpp/Dict_test 1/1 ... [2024-11-01 17:35:09.992288] 2024-11-01T17:35:09.9929912Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:09.9931100Z Running cpp/Dimname_test 1/1 ... [2024-11-01 17:35:09.992422] 2024-11-01T17:35:09.9932098Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:09.9940457Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/Dict_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-a4b9711533262c54.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:09.993228] 2024-11-01T17:35:09.9945615Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/Dimname_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-fe502d5bfa82fd83.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:09.993306] 2024-11-01T17:35:15.9722322Z 2024-11-01T17:35:15.9724006Z cpp/Dimname_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dimname_test_1.1_0d62496a44a46d4f_.log 2024-11-01T17:35:15.9724953Z 2024-11-01T17:35:15.9726528Z Running cpp/NamedTensor_test 1/1 ... [2024-11-01 17:35:15.972169] 2024-11-01T17:35:15.9727134Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:15.9733010Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/NamedTensor_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-28e025cd14263b32.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:15.972701] 2024-11-01T17:35:25.8099242Z 2024-11-01T17:35:25.8102610Z cpp/NamedTensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.NamedTensor_test_1.1_e027dda8507f8cf0_.log 2024-11-01T17:35:25.8104299Z 2024-11-01T17:35:25.8104951Z Running cpp/apply_utils_test 1/1 ... [2024-11-01 17:35:25.809542] 2024-11-01T17:35:25.8106044Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:25.8109787Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/apply_utils_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-7f05120ebf1745d0.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:25.810078] 2024-11-01T17:35:33.1929619Z 2024-11-01T17:35:33.1932908Z cpp/apply_utils_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.apply_utils_test_1.1_5c1353a82cb5be55_.log 2024-11-01T17:35:33.1933868Z 2024-11-01T17:35:33.1934204Z Running cpp/atest 1/1 ... [2024-11-01 17:35:33.192361] 2024-11-01T17:35:33.1934729Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:33.1936329Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/atest', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-0dae02e0580f75d5.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:33.192746] 2024-11-01T17:35:44.7356873Z 2024-11-01T17:35:44.7360577Z cpp/Dict_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.Dict_test_1.1_85d40ddb197d3778_.log 2024-11-01T17:35:44.7362262Z 2024-11-01T17:35:44.7362849Z Running cpp/basic 1/1 ... [2024-11-01 17:35:44.735263] 2024-11-01T17:35:44.7363819Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:44.7371076Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/basic', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-c1a1dab83957eee8.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:44.735741] 2024-11-01T17:35:47.0901032Z 2024-11-01T17:35:47.0903791Z cpp/atest 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.atest_1.1_ebd124bdc5edfae1_.log 2024-11-01T17:35:47.0905328Z 2024-11-01T17:35:47.0906004Z Running cpp/broadcast_test 1/1 ... [2024-11-01 17:35:47.089564] 2024-11-01T17:35:47.0907103Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:47.0910481Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/broadcast_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-0dfe64bfc463fcf6.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:47.089970] 2024-11-01T17:35:51.8161027Z 2024-11-01T17:35:51.8164017Z cpp/broadcast_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.broadcast_test_1.1_a32bdb3f5003ff78_.log 2024-11-01T17:35:51.8165725Z 2024-11-01T17:35:51.8166439Z Running cpp/cpu_generator_test 1/1 ... [2024-11-01 17:35:51.815745] 2024-11-01T17:35:51.8167527Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:51.8170754Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/cpu_generator_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-bc5c8c0bf76df515.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:51.816370] 2024-11-01T17:35:52.2194903Z 2024-11-01T17:35:52.2197298Z cpp/basic 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.basic_1.1_10355593887af7ba_.log 2024-11-01T17:35:52.2198811Z 2024-11-01T17:35:52.2199468Z Running cpp/dlconvertor_test 1/1 ... [2024-11-01 17:35:52.219061] 2024-11-01T17:35:52.2200557Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:52.2204531Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/dlconvertor_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-96d1e178905e78a1.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:52.219424] 2024-11-01T17:35:56.8455852Z 2024-11-01T17:35:56.8458851Z cpp/dlconvertor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.dlconvertor_test_1.1_7a183ac486720384_.log 2024-11-01T17:35:56.8460569Z 2024-11-01T17:35:56.8461672Z Running cpp/extension_backend_test 1/1 ... [2024-11-01 17:35:56.844991] 2024-11-01T17:35:56.8463744Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:35:56.8467339Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/extension_backend_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-43277b2e2b0ed6f9.xml', '-x', '--reruns=2'] ... [2024-11-01 17:35:56.845370] 2024-11-01T17:36:01.5221694Z 2024-11-01T17:36:01.5224566Z cpp/extension_backend_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.extension_backend_test_1.1_74fad863734276e4_.log 2024-11-01T17:36:01.5226382Z 2024-11-01T17:36:01.5227059Z Running cpp/lazy_tensor_test 1/1 ... [2024-11-01 17:36:01.521050] 2024-11-01T17:36:01.5228133Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:01.5231942Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/lazy_tensor_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-b9cad50080d66f73.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:01.521439] 2024-11-01T17:36:05.6640686Z 2024-11-01T17:36:05.6643558Z cpp/cpu_generator_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.cpu_generator_test_1.1_b6bed20c6ac7c361_.log 2024-11-01T17:36:05.6645294Z 2024-11-01T17:36:05.6645994Z Running cpp/legacy_vmap_test 1/1 ... [2024-11-01 17:36:05.663815] 2024-11-01T17:36:05.6647125Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:05.6652760Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/legacy_vmap_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-86f3c7eb29c239b9.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:05.664414] 2024-11-01T17:36:06.1985653Z 2024-11-01T17:36:06.1987640Z cpp/lazy_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.lazy_tensor_test_1.1_3e8a88df314e248a_.log 2024-11-01T17:36:06.1989286Z 2024-11-01T17:36:06.1989910Z Running cpp/native_test 1/1 ... [2024-11-01 17:36:06.198282] 2024-11-01T17:36:06.1991008Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:06.1994192Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/native_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-b36a9e5f09a7d68f.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:06.198650] 2024-11-01T17:36:17.0877565Z 2024-11-01T17:36:17.0880782Z cpp/native_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.native_test_1.1_11a3d20090b648d4_.log 2024-11-01T17:36:17.0882385Z 2024-11-01T17:36:17.0883062Z Running cpp/operators_test 1/1 ... [2024-11-01 17:36:17.087253] 2024-11-01T17:36:17.0884126Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:17.0887433Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/operators_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-50e70d45bd822c66.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:17.087888] 2024-11-01T17:36:23.1171074Z 2024-11-01T17:36:23.1173263Z cpp/operators_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.operators_test_1.1_f39551f2a1f8925f_.log 2024-11-01T17:36:23.1175435Z 2024-11-01T17:36:23.1186114Z Running cpp/scalar_tensor_test 1/1 ... [2024-11-01 17:36:23.116837] 2024-11-01T17:36:23.1187367Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:23.1189170Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/scalar_tensor_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-437e5bb53848257b.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:23.117504] 2024-11-01T17:36:25.0725608Z 2024-11-01T17:36:25.0728886Z cpp/legacy_vmap_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.legacy_vmap_test_1.1_fabe78d3111d9f7c_.log 2024-11-01T17:36:25.0729984Z 2024-11-01T17:36:25.0730310Z Running cpp/scalar_test 1/1 ... [2024-11-01 17:36:25.071846] 2024-11-01T17:36:25.0730873Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:25.0732487Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/scalar_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-3448245415ae3f97.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:25.072194] 2024-11-01T17:36:29.3484210Z 2024-11-01T17:36:29.3486651Z cpp/scalar_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.scalar_tensor_test_1.1_8ec693fbb941f5a6_.log 2024-11-01T17:36:29.3488018Z 2024-11-01T17:36:29.3488572Z Running cpp/tensor_iterator_test 1/1 ... [2024-11-01 17:36:29.347685] 2024-11-01T17:36:29.3489447Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:29.3493129Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/tensor_iterator_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-8caefa36695502dd.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:29.348365] 2024-11-01T17:36:31.3540240Z 2024-11-01T17:36:31.3541967Z cpp/scalar_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.scalar_test_1.1_2289b717f14fcede_.log 2024-11-01T17:36:31.3542843Z 2024-11-01T17:36:31.3543240Z Running cpp/undefined_tensor_test 1/1 ... [2024-11-01 17:36:31.353611] 2024-11-01T17:36:31.3543831Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:31.3554212Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/undefined_tensor_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-89d6223736c81be8.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:31.354032] 2024-11-01T17:36:36.0304257Z 2024-11-01T17:36:36.0306711Z cpp/undefined_tensor_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.undefined_tensor_test_1.1_84449875746b0cea_.log 2024-11-01T17:36:36.0307672Z 2024-11-01T17:36:36.0308003Z Running cpp/wrapdim_test 1/1 ... [2024-11-01 17:36:36.029351] 2024-11-01T17:36:36.0308538Z SCRIBE_GRAPHQL_ACCESS_TOKEN is NOT set 2024-11-01T17:36:36.0310231Z Executing ['pytest', '/var/lib/jenkins/pytorch/build/bin/wrapdim_test', '-m', 'not serial', '-v', '-vv', '-rfEX', '-n', '2', '--junit-xml-reruns', 'test-reports/python-pytest/test.run_test/test.run_test-ec4f0812caceba01.xml', '-x', '--reruns=2'] ... [2024-11-01 17:36:36.029719] 2024-11-01T17:36:40.7059290Z 2024-11-01T17:36:40.7061600Z cpp/wrapdim_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.wrapdim_test_1.1_2da31c82d73c0c05_.log 2024-11-01T17:36:40.7062972Z 2024-11-01T17:37:17.0742119Z 2024-11-01T17:37:17.0744788Z cpp/tensor_iterator_test 1/1 was successful, full logs can be found in artifacts with path test/test-reports/cpp.tensor_iterator_test_1.1_1eae64d99fca0db9_.log 2024-11-01T17:37:17.0746563Z 2024-11-01T17:37:17.9336586Z Running test batch 'tests to run' cost 192.77 seconds 2024-11-01T17:37:18.7775072Z + run_if_exists tensor_interop_test 2024-11-01T17:37:18.7776023Z + local test_name=tensor_interop_test 2024-11-01T17:37:18.7777205Z + [[ -x build/bin/tensor_interop_test ]] 2024-11-01T17:37:18.7779278Z + echo 'Warning: tensor_interop_test does not exist.' 2024-11-01T17:37:18.7780281Z Warning: tensor_interop_test does not exist. 2024-11-01T17:37:18.7781326Z + run_if_exists cudnn_test 2024-11-01T17:37:18.7782032Z + local test_name=cudnn_test 2024-11-01T17:37:18.7782831Z + [[ -x build/bin/cudnn_test ]] 2024-11-01T17:37:18.7783748Z + echo 'Warning: cudnn_test does not exist.' 2024-11-01T17:37:18.7784606Z Warning: cudnn_test does not exist. 2024-11-01T17:37:18.7790979Z + run_if_exists cuda_generator_test 2024-11-01T17:37:18.7792100Z + local test_name=cuda_generator_test 2024-11-01T17:37:18.7793491Z + [[ -x build/bin/cuda_generator_test ]] 2024-11-01T17:37:18.7794224Z + echo 'Warning: cuda_generator_test does not exist.' 2024-11-01T17:37:18.7794772Z Warning: cuda_generator_test does not exist. 2024-11-01T17:37:18.7795242Z + run_if_exists apply_test 2024-11-01T17:37:18.7795601Z + local test_name=apply_test 2024-11-01T17:37:18.7796022Z + [[ -x build/bin/apply_test ]] 2024-11-01T17:37:18.7796505Z + echo 'Warning: apply_test does not exist.' 2024-11-01T17:37:18.7796982Z Warning: apply_test does not exist. 2024-11-01T17:37:18.7797394Z + run_if_exists stream_test 2024-11-01T17:37:18.7797763Z + local test_name=stream_test 2024-11-01T17:37:18.7798175Z + [[ -x build/bin/stream_test ]] 2024-11-01T17:37:18.7798647Z + echo 'Warning: stream_test does not exist.' 2024-11-01T17:37:18.7799107Z Warning: stream_test does not exist. 2024-11-01T17:37:18.7799533Z + run_if_exists cuda_half_test 2024-11-01T17:37:18.7799916Z + local test_name=cuda_half_test 2024-11-01T17:37:18.7800488Z + [[ -x build/bin/cuda_half_test ]] 2024-11-01T17:37:18.7800971Z + echo 'Warning: cuda_half_test does not exist.' 2024-11-01T17:37:18.7801454Z Warning: cuda_half_test does not exist. 2024-11-01T17:37:18.7801917Z + run_if_exists cuda_vectorized_test 2024-11-01T17:37:18.7802334Z + local test_name=cuda_vectorized_test 2024-11-01T17:37:18.7802802Z + [[ -x build/bin/cuda_vectorized_test ]] 2024-11-01T17:37:18.7803351Z + echo 'Warning: cuda_vectorized_test does not exist.' 2024-11-01T17:37:18.7803871Z Warning: cuda_vectorized_test does not exist. 2024-11-01T17:37:18.7804335Z + run_if_exists cuda_distributions_test 2024-11-01T17:37:18.7804770Z + local test_name=cuda_distributions_test 2024-11-01T17:37:18.7805267Z + [[ -x build/bin/cuda_distributions_test ]] 2024-11-01T17:37:18.7805830Z + echo 'Warning: cuda_distributions_test does not exist.' 2024-11-01T17:37:18.7806384Z Warning: cuda_distributions_test does not exist. 2024-11-01T17:37:18.7806852Z + run_if_exists cuda_optional_test 2024-11-01T17:37:18.7807259Z + local test_name=cuda_optional_test 2024-11-01T17:37:18.7807727Z + [[ -x build/bin/cuda_optional_test ]] 2024-11-01T17:37:18.7808252Z + echo 'Warning: cuda_optional_test does not exist.' 2024-11-01T17:37:18.7808764Z Warning: cuda_optional_test does not exist. 2024-11-01T17:37:18.7809222Z + run_if_exists cuda_tensor_interop_test 2024-11-01T17:37:18.7809677Z + local test_name=cuda_tensor_interop_test 2024-11-01T17:37:18.7810178Z + [[ -x build/bin/cuda_tensor_interop_test ]] 2024-11-01T17:37:18.7810755Z + echo 'Warning: cuda_tensor_interop_test does not exist.' 2024-11-01T17:37:18.7811319Z Warning: cuda_tensor_interop_test does not exist. 2024-11-01T17:37:18.7811782Z + run_if_exists cuda_complex_test 2024-11-01T17:37:18.7812189Z + local test_name=cuda_complex_test 2024-11-01T17:37:18.7812639Z + [[ -x build/bin/cuda_complex_test ]] 2024-11-01T17:37:18.7813150Z + echo 'Warning: cuda_complex_test does not exist.' 2024-11-01T17:37:18.7813651Z Warning: cuda_complex_test does not exist. 2024-11-01T17:37:18.7814105Z + run_if_exists cuda_complex_math_test 2024-11-01T17:37:18.7814531Z + local test_name=cuda_complex_math_test 2024-11-01T17:37:18.7815023Z + [[ -x build/bin/cuda_complex_math_test ]] 2024-11-01T17:37:18.7815581Z + echo 'Warning: cuda_complex_math_test does not exist.' 2024-11-01T17:37:18.7816112Z Warning: cuda_complex_math_test does not exist. 2024-11-01T17:37:18.7816565Z + run_if_exists cuda_cub_test 2024-11-01T17:37:18.7817052Z + local test_name=cuda_cub_test 2024-11-01T17:37:18.7817483Z + [[ -x build/bin/cuda_cub_test ]] 2024-11-01T17:37:18.7817956Z + echo 'Warning: cuda_cub_test does not exist.' 2024-11-01T17:37:18.7818586Z Warning: cuda_cub_test does not exist. 2024-11-01T17:37:18.7819016Z + run_if_exists cuda_atomic_ops_test 2024-11-01T17:37:18.7819442Z + local test_name=cuda_atomic_ops_test 2024-11-01T17:37:18.7819920Z + [[ -x build/bin/cuda_atomic_ops_test ]] 2024-11-01T17:37:18.7820459Z + echo 'Warning: cuda_atomic_ops_test does not exist.' 2024-11-01T17:37:18.7820985Z Warning: cuda_atomic_ops_test does not exist. 2024-11-01T17:37:18.7821543Z + '[' OFF == ON ']' 2024-11-01T17:37:18.7821945Z + [[ -n '' ]] 2024-11-01T17:37:18.7822247Z + assert_git_not_dirty 2024-11-01T17:37:18.7822697Z + [[ linux-focal-rocm6.2-py3.10 != *rocm* ]] 2024-11-01T17:37:18.7957341Z ##[group]Run # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2024-11-01T17:37:18.7958671Z # copy test results back to the mounted workspace, needed sudo, resulting permissions were correct 2024-11-01T17:37:18.7959971Z docker exec -t "ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815" sh -c "cd ../pytorch && sudo cp -R test/test-reports ../workspace/test" 2024-11-01T17:37:18.7993001Z shell: /usr/bin/bash -e {0} 2024-11-01T17:37:18.7993374Z env: 2024-11-01T17:37:18.7993668Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:18.7994094Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:18.7994832Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:18.7995523Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:18.7995915Z AWS_REGION: us-east-1 2024-11-01T17:37:18.7996332Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:18.7996899Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:18.8009253Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:18.8010308Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:18.8011462Z ##[endgroup] 2024-11-01T17:37:18.8536005Z sudo: setrlimit(RLIMIT_STACK): Operation not permitted 2024-11-01T17:37:18.8973419Z ##[group]Run cat test/**/*_toprint.log || true 2024-11-01T17:37:18.8974190Z cat test/**/*_toprint.log || true 2024-11-01T17:37:18.9000615Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T17:37:18.9001376Z env: 2024-11-01T17:37:18.9001811Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:18.9002432Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:18.9003525Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:18.9004556Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:18.9005114Z AWS_REGION: us-east-1 2024-11-01T17:37:18.9005748Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:18.9006477Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:18.9015977Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:18.9016810Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:18.9017771Z ##[endgroup] 2024-11-01T17:37:18.9180055Z cat: 'test/**/*_toprint.log': No such file or directory 2024-11-01T17:37:18.9236159Z ##[group]Run kill "$MONITOR_SCRIPT_PID" 2024-11-01T17:37:18.9236660Z kill "$MONITOR_SCRIPT_PID" 2024-11-01T17:37:18.9270011Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T17:37:18.9270517Z env: 2024-11-01T17:37:18.9270820Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:18.9271237Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:18.9271975Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:18.9272680Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:18.9273053Z AWS_REGION: us-east-1 2024-11-01T17:37:18.9273503Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:18.9274010Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:18.9282777Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:18.9283836Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:18.9285303Z MONITOR_SCRIPT_PID: 2661005 2024-11-01T17:37:18.9285984Z ##[endgroup] 2024-11-01T17:37:18.9621198Z Prepare all required actions 2024-11-01T17:37:18.9622106Z Getting action download info 2024-11-01T17:37:19.6132684Z Download action repository 'seemethere/upload-artifact-s3@v5' (SHA:baba72d0712b404f646cebe0730933554ebce96a) 2024-11-01T17:37:20.2157446Z ##[group]Run ./.github/actions/upload-test-artifacts 2024-11-01T17:37:20.2157896Z with: 2024-11-01T17:37:20.2158165Z use-gha: true 2024-11-01T17:37:20.2158572Z file-suffix: test-default-1-2-linux.rocm.gpu_32397140314 2024-11-01T17:37:20.2159207Z s3-bucket: gha-artifacts 2024-11-01T17:37:20.2159528Z env: 2024-11-01T17:37:20.2159784Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:20.2160115Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:20.2160707Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:20.2161269Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:20.2161576Z AWS_REGION: us-east-1 2024-11-01T17:37:20.2161951Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:20.2162353Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:20.2167297Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:20.2167746Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:20.2168319Z ##[endgroup] 2024-11-01T17:37:20.2238262Z ##[group]Run actions/upload-artifact@v3 2024-11-01T17:37:20.2238602Z with: 2024-11-01T17:37:20.2239014Z name: test-jsons-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip 2024-11-01T17:37:20.2239516Z retention-days: 14 2024-11-01T17:37:20.2239804Z if-no-files-found: warn 2024-11-01T17:37:20.2240089Z path: test/**/*.json 2024-11-01T17:37:20.2240372Z include-hidden-files: false 2024-11-01T17:37:20.2240661Z env: 2024-11-01T17:37:20.2240892Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:20.2241224Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:20.2241806Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:20.2242349Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:20.2242653Z AWS_REGION: us-east-1 2024-11-01T17:37:20.2242979Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:20.2243368Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:20.2248330Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:20.2248784Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:20.2249262Z ##[endgroup] 2024-11-01T17:37:20.6697072Z With the provided path, there will be 7 files uploaded 2024-11-01T17:37:20.6699215Z Starting artifact upload 2024-11-01T17:37:20.6701889Z For more detailed logs during the artifact upload process, enable step-debugging: https://docs.github.com/actions/monitoring-and-troubleshooting-workflows/enabling-debug-logging#enabling-step-debug-logging 2024-11-01T17:37:20.6704259Z Artifact name is valid! 2024-11-01T17:37:21.0125695Z Container for artifact "test-jsons-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip" successfully created. Starting upload of file(s) 2024-11-01T17:37:21.6601688Z Total size of all the files uploaded is 40937 bytes 2024-11-01T17:37:21.6602974Z File upload process has finished. Finalizing the artifact upload 2024-11-01T17:37:21.7454942Z Artifact has been finalized. All files have been successfully uploaded! 2024-11-01T17:37:21.7455841Z 2024-11-01T17:37:21.7456475Z The raw size of all the files that were specified for upload is 360512 bytes 2024-11-01T17:37:21.7458907Z The size of all the files that were uploaded is 40937 bytes. This takes into account any gzip compression used to reduce the upload size, time and storage 2024-11-01T17:37:21.7460502Z 2024-11-01T17:37:21.7462318Z Note: The size of downloaded zips can differ significantly from the reported size. For more information see: https://github.com/actions/upload-artifact#zipped-artifact-downloads 2024-11-01T17:37:21.7464514Z 2024-11-01T17:37:21.7465730Z Artifact test-jsons-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip has been successfully uploaded! 2024-11-01T17:37:21.7777272Z ##[group]Run actions/upload-artifact@v3 2024-11-01T17:37:21.7778271Z with: 2024-11-01T17:37:21.7779284Z name: test-reports-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip 2024-11-01T17:37:21.7780509Z retention-days: 14 2024-11-01T17:37:21.7781157Z if-no-files-found: ignore 2024-11-01T17:37:21.7781876Z path: test/**/*.xml test/**/*.csv 2024-11-01T17:37:21.7782671Z include-hidden-files: false 2024-11-01T17:37:21.7783344Z env: 2024-11-01T17:37:21.7783872Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:21.7784885Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:21.7786248Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:21.7787530Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:21.7788240Z AWS_REGION: us-east-1 2024-11-01T17:37:21.7789040Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:21.7789962Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:21.7802093Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:21.7803153Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:21.7804284Z ##[endgroup] 2024-11-01T17:37:22.3049727Z With the provided path, there will be 58 files uploaded 2024-11-01T17:37:22.3051677Z Starting artifact upload 2024-11-01T17:37:22.3054582Z For more detailed logs during the artifact upload process, enable step-debugging: https://docs.github.com/actions/monitoring-and-troubleshooting-workflows/enabling-debug-logging#enabling-step-debug-logging 2024-11-01T17:37:22.3056981Z Artifact name is valid! 2024-11-01T17:37:22.6503445Z Container for artifact "test-reports-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip" successfully created. Starting upload of file(s) 2024-11-01T17:37:26.7299696Z Total size of all the files uploaded is 450386 bytes 2024-11-01T17:37:26.7300965Z File upload process has finished. Finalizing the artifact upload 2024-11-01T17:37:26.8102951Z Artifact has been finalized. All files have been successfully uploaded! 2024-11-01T17:37:26.8103912Z 2024-11-01T17:37:26.8104617Z The raw size of all the files that were specified for upload is 12046372 bytes 2024-11-01T17:37:26.8106831Z The size of all the files that were uploaded is 450386 bytes. This takes into account any gzip compression used to reduce the upload size, time and storage 2024-11-01T17:37:26.8108396Z 2024-11-01T17:37:26.8110247Z Note: The size of downloaded zips can differ significantly from the reported size. For more information see: https://github.com/actions/upload-artifact#zipped-artifact-downloads 2024-11-01T17:37:26.8112064Z 2024-11-01T17:37:26.8113308Z Artifact test-reports-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip has been successfully uploaded! 2024-11-01T17:37:26.8348880Z ##[group]Run actions/upload-artifact@v3 2024-11-01T17:37:26.8349680Z with: 2024-11-01T17:37:26.8350674Z name: logs-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip 2024-11-01T17:37:26.8352051Z retention-days: 14 2024-11-01T17:37:26.8352812Z if-no-files-found: ignore 2024-11-01T17:37:26.8353687Z path: usage_log.txt test/**/*.log 2024-11-01T17:37:26.8354615Z include-hidden-files: false 2024-11-01T17:37:26.8355405Z env: 2024-11-01T17:37:26.8355882Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:26.8356361Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:26.8357089Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:26.8357766Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:26.8358149Z AWS_REGION: us-east-1 2024-11-01T17:37:26.8358589Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:26.8359076Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:26.8369225Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:26.8370275Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:26.8371401Z ##[endgroup] 2024-11-01T17:37:27.3662237Z Multiple search paths detected. Calculating the least common ancestor of all paths 2024-11-01T17:37:27.3666164Z The least common ancestor is /home/pytorchci/actions-runner/_work/pytorch/pytorch. This will be the root directory of the artifact 2024-11-01T17:37:27.3668132Z With the provided path, there will be 57 files uploaded 2024-11-01T17:37:27.3669561Z Starting artifact upload 2024-11-01T17:37:27.3672187Z For more detailed logs during the artifact upload process, enable step-debugging: https://docs.github.com/actions/monitoring-and-troubleshooting-workflows/enabling-debug-logging#enabling-step-debug-logging 2024-11-01T17:37:27.3674604Z Artifact name is valid! 2024-11-01T17:37:27.7108674Z Container for artifact "logs-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip" successfully created. Starting upload of file(s) 2024-11-01T17:37:32.1066541Z Total size of all the files uploaded is 878091 bytes 2024-11-01T17:37:32.1067852Z File upload process has finished. Finalizing the artifact upload 2024-11-01T17:37:32.1996046Z Artifact has been finalized. All files have been successfully uploaded! 2024-11-01T17:37:32.1996983Z 2024-11-01T17:37:32.1997712Z The raw size of all the files that were specified for upload is 20440906 bytes 2024-11-01T17:37:32.1999921Z The size of all the files that were uploaded is 878091 bytes. This takes into account any gzip compression used to reduce the upload size, time and storage 2024-11-01T17:37:32.2001487Z 2024-11-01T17:37:32.2003332Z Note: The size of downloaded zips can differ significantly from the reported size. For more information see: https://github.com/actions/upload-artifact#zipped-artifact-downloads 2024-11-01T17:37:32.2005154Z 2024-11-01T17:37:32.2006280Z Artifact logs-runattempt1-test-default-1-2-linux.rocm.gpu_32397140314.zip has been successfully uploaded! 2024-11-01T17:37:32.2270505Z ##[group]Run # shellcheck disable=SC2156 2024-11-01T17:37:32.2271456Z # shellcheck disable=SC2156 2024-11-01T17:37:32.2272909Z find . -iname "core.[1-9]*" -exec docker exec "${CONTAINER_NAME}" sh -c "gdb python {} -ex 'bt' -ex 'q'" \; 2024-11-01T17:37:32.2308351Z shell: /usr/bin/bash -e {0} 2024-11-01T17:37:32.2309032Z env: 2024-11-01T17:37:32.2309567Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:32.2310347Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:32.2311738Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:32.2313077Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:32.2313796Z AWS_REGION: us-east-1 2024-11-01T17:37:32.2314591Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:32.2315519Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:32.2327721Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:32.2328827Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:32.2329973Z ##[endgroup] 2024-11-01T17:37:32.5335176Z Prepare all required actions 2024-11-01T17:37:32.5336080Z Getting action download info 2024-11-01T17:37:32.5378387Z ##[group]Run ./.github/actions/teardown-rocm 2024-11-01T17:37:32.5378828Z env: 2024-11-01T17:37:32.5379125Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:32.5379528Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:32.5380251Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:32.5380923Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:32.5381302Z AWS_REGION: us-east-1 2024-11-01T17:37:32.5381723Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:32.5382215Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:32.5388446Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:32.5389005Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:32.5389606Z ##[endgroup] 2024-11-01T17:37:32.5419763Z ##[group]Run # ignore expansion of "docker ps -q" since it could be empty 2024-11-01T17:37:32.5421144Z # ignore expansion of "docker ps -q" since it could be empty 2024-11-01T17:37:32.5422450Z # shellcheck disable=SC2046 2024-11-01T17:37:32.5423316Z docker stop $(docker ps -q) || true 2024-11-01T17:37:32.5424187Z # Prune all stopped containers. 2024-11-01T17:37:32.5425104Z docker container prune -f 2024-11-01T17:37:32.5457306Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T17:37:32.5458427Z env: 2024-11-01T17:37:32.5458995Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:32.5459770Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:32.5461153Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:32.5462444Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:32.5463448Z AWS_REGION: us-east-1 2024-11-01T17:37:32.5464214Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:32.5465255Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:32.5474234Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:32.5474783Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:32.5475379Z ##[endgroup] 2024-11-01T17:37:32.9784576Z ff4898059595 2024-11-01T17:37:50.8262910Z Deleted Containers: 2024-11-01T17:37:50.8263885Z ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:50.8264628Z 2024-11-01T17:37:50.8264874Z Total reclaimed space: 8.275GB 2024-11-01T17:37:50.8354040Z Prepare all required actions 2024-11-01T17:37:50.8413044Z ##[group]Run ./.github/actions/diskspace-cleanup 2024-11-01T17:37:50.8413872Z with: 2024-11-01T17:37:50.8414416Z diskspace-cutoff: 70 2024-11-01T17:37:50.8415017Z env: 2024-11-01T17:37:50.8415538Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:50.8416295Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:50.8417667Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:50.8419219Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:50.8419947Z AWS_REGION: us-east-1 2024-11-01T17:37:50.8420718Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:50.8421678Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:50.8434288Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:50.8435542Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:50.8436829Z ##[endgroup] 2024-11-01T17:37:50.8466806Z ##[group]Run set -ex 2024-11-01T17:37:50.8467459Z set -ex 2024-11-01T17:37:50.8468080Z diskspace_cutoff=70 2024-11-01T17:37:50.8469025Z docker_root_dir=$(docker info -f '{{.DockerRootDir}}') 2024-11-01T17:37:50.8470530Z diskspace=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2024-11-01T17:37:50.8473023Z msg="Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified" 2024-11-01T17:37:50.8475885Z if [[ "$diskspace" -ge "$diskspace_cutoff" ]] ; then 2024-11-01T17:37:50.8476927Z  docker system prune -af 2024-11-01T17:37:50.8477693Z  diskspace_new=$(df -H --output=pcent ${docker_root_dir} | sed -n 2p | sed 's/%//' | sed 's/ //') 2024-11-01T17:37:50.8478474Z  if [[ "$diskspace_new" -gt "$diskspace_cutoff" ]] ; then 2024-11-01T17:37:50.8479292Z  echo "Error: Available diskspace is less than $diskspace_cutoff percent. Not enough diskspace." 2024-11-01T17:37:50.8479988Z  echo "$msg" 2024-11-01T17:37:50.8480343Z  exit 1 2024-11-01T17:37:50.8480669Z  else 2024-11-01T17:37:50.8481062Z  difference=$((diskspace - diskspace_new)) 2024-11-01T17:37:50.8481615Z  echo "Diskspace saved: $difference percent" 2024-11-01T17:37:50.8482079Z  fi 2024-11-01T17:37:50.8482372Z fi 2024-11-01T17:37:50.8516707Z shell: /usr/bin/bash --noprofile --norc -e -o pipefail {0} 2024-11-01T17:37:50.8517313Z env: 2024-11-01T17:37:50.8517625Z GIT_DEFAULT_BRANCH: main 2024-11-01T17:37:50.8518032Z DOCKER_HOST: unix:///run/user/1001/docker.sock 2024-11-01T17:37:50.8518874Z GPU_FLAG: --device=/dev/mem --device=/dev/kfd --device=/dev/dri --group-add video --group-add daemon 2024-11-01T17:37:50.8519556Z AWS_DEFAULT_REGION: us-east-1 2024-11-01T17:37:50.8519930Z AWS_REGION: us-east-1 2024-11-01T17:37:50.8520322Z AWS_ACCESS_KEY_ID: *** 2024-11-01T17:37:50.8520805Z AWS_SECRET_ACCESS_KEY: *** 2024-11-01T17:37:50.8531408Z AWS_SESSION_TOKEN: *** 2024-11-01T17:37:50.8532449Z CONTAINER_NAME: ff4898059595dda38338ed9a44cf8707301f5ef964805ebcc3be4fb492a5c815 2024-11-01T17:37:50.8533579Z ##[endgroup] 2024-11-01T17:37:50.8594411Z + diskspace_cutoff=70 2024-11-01T17:37:50.8601072Z ++ docker info -f '{{.DockerRootDir}}' 2024-11-01T17:37:50.9213327Z + docker_root_dir=/home/pytorchci/.local/share/docker 2024-11-01T17:37:50.9219625Z ++ df -H --output=pcent /home/pytorchci/.local/share/docker 2024-11-01T17:37:50.9220689Z ++ sed -n 2p 2024-11-01T17:37:50.9221214Z ++ sed s/%// 2024-11-01T17:37:50.9222530Z ++ sed 's/ //' 2024-11-01T17:37:50.9239766Z + diskspace=21 2024-11-01T17:37:50.9241668Z + msg='Please file an issue on pytorch/pytorch reporting the faulty runner. Include a link to the runner logs so the runner can be identified' 2024-11-01T17:37:50.9243457Z + [[ 21 -ge 70 ]] 2024-11-01T17:37:50.9333049Z Post job cleanup. 2024-11-01T17:37:50.9414049Z Post job cleanup. 2024-11-01T17:37:50.9798794Z Logging out of registry 308535385114.dkr.ecr.us-east-1.amazonaws.com 2024-11-01T17:37:51.0150937Z Post job cleanup. 2024-11-01T17:37:51.1487917Z Post job cleanup. 2024-11-01T17:37:51.1607018Z Post job cleanup. 2024-11-01T17:37:51.1707445Z Post job cleanup. 2024-11-01T17:37:51.2442554Z [command]/usr/bin/git version 2024-11-01T17:37:51.2481337Z git version 2.34.1 2024-11-01T17:37:51.2529672Z Temporarily overriding HOME='/home/pytorchci/actions-runner/_work/_temp/6ec86270-a246-4beb-bf34-b6bbe5f497d3' before making global git config changes 2024-11-01T17:37:51.2532046Z Adding repository directory to the temporary git global config as a safe directory 2024-11-01T17:37:51.2534110Z [command]/usr/bin/git config --global --add safe.directory /home/pytorchci/actions-runner/_work/pytorch/pytorch 2024-11-01T17:37:51.2563762Z [command]/usr/bin/git config --local --name-only --get-regexp core\.sshCommand 2024-11-01T17:37:51.2597992Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'core\.sshCommand' && git config --local --unset-all 'core.sshCommand' || :" 2024-11-01T17:37:51.2920247Z Entering 'android/libs/fbjni' 2024-11-01T17:37:51.2967053Z Entering 'third_party/FP16' 2024-11-01T17:37:51.3014542Z Entering 'third_party/FXdiv' 2024-11-01T17:37:51.3063373Z Entering 'third_party/NNPACK' 2024-11-01T17:37:51.3102076Z Entering 'third_party/NVTX' 2024-11-01T17:37:51.3148381Z Entering 'third_party/VulkanMemoryAllocator' 2024-11-01T17:37:51.3206888Z Entering 'third_party/XNNPACK' 2024-11-01T17:37:51.3266925Z Entering 'third_party/benchmark' 2024-11-01T17:37:51.3307155Z Entering 'third_party/composable_kernel' 2024-11-01T17:37:51.3359795Z Entering 'third_party/cpp-httplib' 2024-11-01T17:37:51.3402845Z Entering 'third_party/cpuinfo' 2024-11-01T17:37:51.3450296Z Entering 'third_party/cudnn_frontend' 2024-11-01T17:37:51.3492414Z Entering 'third_party/cutlass' 2024-11-01T17:37:51.3541468Z Entering 'third_party/eigen' 2024-11-01T17:37:51.3587740Z Entering 'third_party/fbgemm' 2024-11-01T17:37:51.3632149Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-11-01T17:37:51.3678420Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-11-01T17:37:51.3719933Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-11-01T17:37:51.3768680Z Entering 'third_party/fbgemm/third_party/googletest' 2024-11-01T17:37:51.3809743Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-11-01T17:37:51.3854427Z Entering 'third_party/flatbuffers' 2024-11-01T17:37:51.3905259Z Entering 'third_party/fmt' 2024-11-01T17:37:51.3945773Z Entering 'third_party/gemmlowp/gemmlowp' 2024-11-01T17:37:51.3985844Z Entering 'third_party/gloo' 2024-11-01T17:37:51.4032569Z Entering 'third_party/googletest' 2024-11-01T17:37:51.4079125Z Entering 'third_party/ideep' 2024-11-01T17:37:51.4123926Z Entering 'third_party/ideep/mkl-dnn' 2024-11-01T17:37:51.4174433Z Entering 'third_party/ittapi' 2024-11-01T17:37:51.4222593Z Entering 'third_party/kineto' 2024-11-01T17:37:51.4265549Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-11-01T17:37:51.4312408Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-11-01T17:37:51.4354162Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-11-01T17:37:51.4396909Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-11-01T17:37:51.4439042Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-11-01T17:37:51.4478208Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-11-01T17:37:51.4527602Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-11-01T17:37:51.4568774Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-11-01T17:37:51.4609032Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-11-01T17:37:51.4649055Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-11-01T17:37:51.4691178Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-11-01T17:37:51.4731172Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-11-01T17:37:51.4779763Z Entering 'third_party/mimalloc' 2024-11-01T17:37:51.4824067Z Entering 'third_party/nccl/nccl' 2024-11-01T17:37:51.4873783Z Entering 'third_party/nlohmann' 2024-11-01T17:37:51.4914865Z Entering 'third_party/onnx' 2024-11-01T17:37:51.4973117Z Entering 'third_party/onnx/third_party/pybind11' 2024-11-01T17:37:51.5025033Z Entering 'third_party/opentelemetry-cpp' 2024-11-01T17:37:51.5073761Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-11-01T17:37:51.5119059Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-11-01T17:37:51.5158892Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-11-01T17:37:51.5197987Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-11-01T17:37:51.5237929Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-11-01T17:37:51.5276347Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-11-01T17:37:51.5315020Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-11-01T17:37:51.5355459Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-11-01T17:37:51.5402559Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-11-01T17:37:51.5445624Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-11-01T17:37:51.5507732Z Entering 'third_party/pocketfft' 2024-11-01T17:37:51.5548232Z Entering 'third_party/protobuf' 2024-11-01T17:37:51.5596032Z Entering 'third_party/protobuf/third_party/benchmark' 2024-11-01T17:37:51.5635326Z Entering 'third_party/protobuf/third_party/googletest' 2024-11-01T17:37:51.5678849Z Entering 'third_party/psimd' 2024-11-01T17:37:51.5729962Z Entering 'third_party/pthreadpool' 2024-11-01T17:37:51.5776400Z Entering 'third_party/pybind11' 2024-11-01T17:37:51.5816691Z Entering 'third_party/python-peachpy' 2024-11-01T17:37:51.5861861Z Entering 'third_party/sleef' 2024-11-01T17:37:51.5902110Z Entering 'third_party/tensorpipe' 2024-11-01T17:37:51.5945554Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-11-01T17:37:51.5990288Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-11-01T17:37:51.6031791Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-11-01T17:37:51.6075471Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-11-01T17:37:51.6121876Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-11-01T17:37:51.6198452Z [command]/usr/bin/git config --local --name-only --get-regexp http\.https\:\/\/github\.com\/\.extraheader 2024-11-01T17:37:51.6294781Z http.https://github.com/.extraheader 2024-11-01T17:37:51.6297417Z [command]/usr/bin/git config --local --unset-all http.https://github.com/.extraheader 2024-11-01T17:37:51.6312415Z [command]/usr/bin/git submodule foreach --recursive sh -c "git config --local --name-only --get-regexp 'http\.https\:\/\/github\.com\/\.extraheader' && git config --local --unset-all 'http.https://github.com/.extraheader' || :" 2024-11-01T17:37:51.6605785Z Entering 'android/libs/fbjni' 2024-11-01T17:37:51.6630799Z http.https://github.com/.extraheader 2024-11-01T17:37:51.6674289Z Entering 'third_party/FP16' 2024-11-01T17:37:51.6704599Z http.https://github.com/.extraheader 2024-11-01T17:37:51.6733951Z Entering 'third_party/FXdiv' 2024-11-01T17:37:51.6756927Z http.https://github.com/.extraheader 2024-11-01T17:37:51.6786947Z Entering 'third_party/NNPACK' 2024-11-01T17:37:51.6809870Z http.https://github.com/.extraheader 2024-11-01T17:37:51.6843216Z Entering 'third_party/NVTX' 2024-11-01T17:37:51.6869627Z http.https://github.com/.extraheader 2024-11-01T17:37:51.6904620Z Entering 'third_party/VulkanMemoryAllocator' 2024-11-01T17:37:51.6923261Z http.https://github.com/.extraheader 2024-11-01T17:37:51.6964471Z Entering 'third_party/XNNPACK' 2024-11-01T17:37:51.6985704Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7036127Z Entering 'third_party/benchmark' 2024-11-01T17:37:51.7058297Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7248326Z Entering 'third_party/composable_kernel' 2024-11-01T17:37:51.7274159Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7310878Z Entering 'third_party/cpp-httplib' 2024-11-01T17:37:51.7333811Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7365152Z Entering 'third_party/cpuinfo' 2024-11-01T17:37:51.7386178Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7425745Z Entering 'third_party/cudnn_frontend' 2024-11-01T17:37:51.7447594Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7481906Z Entering 'third_party/cutlass' 2024-11-01T17:37:51.7509348Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7551376Z Entering 'third_party/eigen' 2024-11-01T17:37:51.7572303Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7612282Z Entering 'third_party/fbgemm' 2024-11-01T17:37:51.7639577Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7670312Z Entering 'third_party/fbgemm/third_party/asmjit' 2024-11-01T17:37:51.7703753Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7737771Z Entering 'third_party/fbgemm/third_party/cpuinfo' 2024-11-01T17:37:51.7760888Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7800796Z Entering 'third_party/fbgemm/third_party/cutlass' 2024-11-01T17:37:51.7827094Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7865015Z Entering 'third_party/fbgemm/third_party/googletest' 2024-11-01T17:37:51.7883756Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7921687Z Entering 'third_party/fbgemm/third_party/hipify_torch' 2024-11-01T17:37:51.7945627Z http.https://github.com/.extraheader 2024-11-01T17:37:51.7979031Z Entering 'third_party/flatbuffers' 2024-11-01T17:37:51.8002772Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8041853Z Entering 'third_party/fmt' 2024-11-01T17:37:51.8062599Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8094134Z Entering 'third_party/gemmlowp/gemmlowp' 2024-11-01T17:37:51.8117635Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8147915Z Entering 'third_party/gloo' 2024-11-01T17:37:51.8171301Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8207662Z Entering 'third_party/googletest' 2024-11-01T17:37:51.8228891Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8265914Z Entering 'third_party/ideep' 2024-11-01T17:37:51.8287252Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8316666Z Entering 'third_party/ideep/mkl-dnn' 2024-11-01T17:37:51.8347345Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8389812Z Entering 'third_party/ittapi' 2024-11-01T17:37:51.8411903Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8448244Z Entering 'third_party/kineto' 2024-11-01T17:37:51.8469653Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8512069Z Entering 'third_party/kineto/libkineto/third_party/dynolog' 2024-11-01T17:37:51.8538967Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8572318Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/DCGM' 2024-11-01T17:37:51.8594759Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8629983Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/cpr' 2024-11-01T17:37:51.8650749Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8687398Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/fmt' 2024-11-01T17:37:51.8708178Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8740838Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags' 2024-11-01T17:37:51.8766931Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8795608Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/gflags/doc' 2024-11-01T17:37:51.8815839Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8854432Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/glog' 2024-11-01T17:37:51.8877284Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8910725Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/googletest' 2024-11-01T17:37:51.8931851Z http.https://github.com/.extraheader 2024-11-01T17:37:51.8963399Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/json' 2024-11-01T17:37:51.8983016Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9024017Z Entering 'third_party/kineto/libkineto/third_party/dynolog/third_party/pfs' 2024-11-01T17:37:51.9044655Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9081341Z Entering 'third_party/kineto/libkineto/third_party/fmt' 2024-11-01T17:37:51.9105207Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9146733Z Entering 'third_party/kineto/libkineto/third_party/googletest' 2024-11-01T17:37:51.9167007Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9201133Z Entering 'third_party/mimalloc' 2024-11-01T17:37:51.9227686Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9257975Z Entering 'third_party/nccl/nccl' 2024-11-01T17:37:51.9280932Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9316449Z Entering 'third_party/nlohmann' 2024-11-01T17:37:51.9338697Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9368717Z Entering 'third_party/onnx' 2024-11-01T17:37:51.9389217Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9431809Z Entering 'third_party/onnx/third_party/pybind11' 2024-11-01T17:37:51.9460037Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9496282Z Entering 'third_party/opentelemetry-cpp' 2024-11-01T17:37:51.9517560Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9554932Z Entering 'third_party/opentelemetry-cpp/third_party/benchmark' 2024-11-01T17:37:51.9579392Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9624184Z Entering 'third_party/opentelemetry-cpp/third_party/googletest' 2024-11-01T17:37:51.9644301Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9678249Z Entering 'third_party/opentelemetry-cpp/third_party/ms-gsl' 2024-11-01T17:37:51.9698981Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9733434Z Entering 'third_party/opentelemetry-cpp/third_party/nlohmann-json' 2024-11-01T17:37:51.9755644Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9788687Z Entering 'third_party/opentelemetry-cpp/third_party/opentelemetry-proto' 2024-11-01T17:37:51.9808743Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9846130Z Entering 'third_party/opentelemetry-cpp/third_party/opentracing-cpp' 2024-11-01T17:37:51.9866281Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9897938Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp' 2024-11-01T17:37:51.9922432Z http.https://github.com/.extraheader 2024-11-01T17:37:51.9961299Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/civetweb' 2024-11-01T17:37:51.9991093Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0027084Z Entering 'third_party/opentelemetry-cpp/third_party/prometheus-cpp/3rdparty/googletest' 2024-11-01T17:37:52.0047802Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0086684Z Entering 'third_party/opentelemetry-cpp/tools/vcpkg' 2024-11-01T17:37:52.0112177Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0166213Z Entering 'third_party/pocketfft' 2024-11-01T17:37:52.0194033Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0230275Z Entering 'third_party/protobuf' 2024-11-01T17:37:52.0254097Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0294129Z Entering 'third_party/protobuf/third_party/benchmark' 2024-11-01T17:37:52.0315822Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0355158Z Entering 'third_party/protobuf/third_party/googletest' 2024-11-01T17:37:52.0375044Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0415189Z Entering 'third_party/psimd' 2024-11-01T17:37:52.0445067Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0481735Z Entering 'third_party/pthreadpool' 2024-11-01T17:37:52.0505836Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0538364Z Entering 'third_party/pybind11' 2024-11-01T17:37:52.0561164Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0597766Z Entering 'third_party/python-peachpy' 2024-11-01T17:37:52.0618915Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0654509Z Entering 'third_party/sleef' 2024-11-01T17:37:52.0675193Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0709656Z Entering 'third_party/tensorpipe' 2024-11-01T17:37:52.0738660Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0776189Z Entering 'third_party/tensorpipe/third_party/googletest' 2024-11-01T17:37:52.0802443Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0838547Z Entering 'third_party/tensorpipe/third_party/libnop' 2024-11-01T17:37:52.0858489Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0890165Z Entering 'third_party/tensorpipe/third_party/libuv' 2024-11-01T17:37:52.0912549Z http.https://github.com/.extraheader 2024-11-01T17:37:52.0952842Z Entering 'third_party/tensorpipe/third_party/pybind11' 2024-11-01T17:37:52.0973059Z http.https://github.com/.extraheader 2024-11-01T17:37:52.1006480Z Entering 'third_party/tensorpipe/third_party/pybind11/tools/clang' 2024-11-01T17:37:52.1031717Z http.https://github.com/.extraheader 2024-11-01T17:37:52.1368753Z Cleaning up orphan processes